Skip to content

Instantly share code, notes, and snippets.

View xingjinglu's full-sized avatar

xingjinglu xingjinglu

View GitHub Profile
@simon-mo
simon-mo / cudnn7.global.kernel
Last active August 15, 2022 02:01
All cuDNN compute kernels
STT_FUNC STB_GLOBAL STO_ENTRY cudnn_maxwell_gcgemm_32x32_cc
STT_FUNC STB_GLOBAL STO_ENTRY cudnn_maxwell_gcgemm_32x32_cc_batched
STT_FUNC STB_GLOBAL STO_ENTRY cudnn_maxwell_gcgemm_32x32_cn
STT_FUNC STB_GLOBAL STO_ENTRY cudnn_maxwell_gcgemm_32x32_cn_batched
STT_FUNC STB_GLOBAL STO_ENTRY cudnn_maxwell_gcgemm_32x32_ct
STT_FUNC STB_GLOBAL STO_ENTRY cudnn_maxwell_gcgemm_32x32_ct_batched
STT_FUNC STB_GLOBAL STO_ENTRY cudnn_maxwell_gcgemm_32x32_lower_cn
STT_FUNC STB_GLOBAL STO_ENTRY cudnn_maxwell_gcgemm_32x32_lower_cn_batched
STT_FUNC STB_GLOBAL STO_ENTRY cudnn_maxwell_gcgemm_32x32_lower_nc
STT_FUNC STB_GLOBAL STO_ENTRY cudnn_maxwell_gcgemm_32x32_lower_nc_batched
// Copyright (c) 2018 Gang Liao <gangliao@cs.umd.edu>. All Rights Reserved.
// Licensed under the Apache License, Version 2.0 (the "License");
// you may not use this file except in compliance with the License.
// You may obtain a copy of the License at
// http://www.apache.org/licenses/LICENSE-2.0
// Unless required by applicable law or agreed to in writing, software
// distributed under the License is distributed on an "AS IS" BASIS,
@gangliao
gangliao / plot_bench_1.py
Last active May 24, 2024 10:53
Horovod with TCP and IB
import matplotlib.pyplot as plt
#for plotting
import numpy as np
# create plot
fig, ax = plt.subplots()
bar_width = 0.15
opacity = 0.8
xlabel= np.array([8, 16, 32, 64])
@Mistobaan
Mistobaan / TENSORFLOW_DEBUG.md
Last active June 29, 2023 06:32
Tensorflow Internals Debugging Techniques

Machine Setup August 2016

Linux Ubuntu 2016.

  • 1080 GTX
  • SDK 8.0
  • CuDNN 5.1

ENABLE Core dumps

ulimit -c unlimited
@kevinhughes27
kevinhughes27 / Makefile
Created April 4, 2013 15:51
g++ Makefile for OpenCV Project
CC = g++
CFLAGS = -g -Wall
SRCS = HelloWorld.cpp
PROG = HelloWorld
OPENCV = `pkg-config opencv --cflags --libs`
LIBS = $(OPENCV)
$(PROG):$(SRCS)
$(CC) $(CFLAGS) -o $(PROG) $(SRCS) $(LIBS)