Skip to content

Instantly share code, notes, and snippets.

@petro-rudenko
petro-rudenko / CUDA_Compilers.md
Created December 4, 2019 19:10 — forked from ax3l/CUDA_Compilers.md
CUDA Compilers

In general, check the crt/host_config.h file to find out which versions are supported. Sometimes it is possible to hack the requirements there to get some newer versions working, too :)

Thrust version can be found in $CUDA_ROOT/include/thrust/version.h.

Release notes for CUDA:

@petro-rudenko
petro-rudenko / RuntimeUtils.scala
Created October 14, 2019 08:44 — forked from jvican/RuntimeUtils.scala
Some Scala code that uses Java APIs present in tools.jar (only JDKs) to programmatically produce a jstack-like thread dump. Useful to debug application and test deadlocks.
object RuntimeUtils {
def requestThreadDump: String = {
// Get the PID of the current JVM process
val selfName = java.lang.management.ManagementFactory.getRuntimeMXBean().getName()
val selfPid = selfName.substring(0, selfName.indexOf('@'))
// Attach to the VM
import com.sun.tools.attach.VirtualMachine
import sun.tools.attach.HotSpotVirtualMachine;
val vm = VirtualMachine.attach(selfPid);
@petro-rudenko
petro-rudenko / 14_12_1220_syndrome_list.log
Created October 26, 2018 09:45 — forked from lukego/14_12_1220_syndrome_list.log
Mellanox error syndrome lists
BAD_RES_STATE | 0x25B161 | destroy_ctx - context doesn't exist or doesn't match type
BAD_RES_STATE | 0x4A6FC9 | destroy_ctx - context in use
BAD_RES_STATE | 0x60DA55 | destroy_dct - dct not in drained state
BAD_PARAM | 0x67A6F2 | slrg doesnt support write;
BAD_PARAM | 0x0F0E35 | ppamp doesnt support write;
BAD_PKT | 0x4A22F | access reg MAD with specified register id not supported
BAD_PKT | 0x16C592 | mad_ifc: process_smp_lid mkey check failed - silently discarded
INTERNAL_ERR | 0x079233 | set_get_port_info: silently discarded.
BAD_PKT | 0x468496 | mad_ifc: ATTRV_SM_INFO handled by SW
BAD_PKT | 0x071808 | mad_ifc: smp trap repress silently discarded after processing.
@petro-rudenko
petro-rudenko / tf_lstm.py
Created October 5, 2016 08:20 — forked from siemanko/tf_lstm.py
Simple implementation of LSTM in Tensorflow in 50 lines (+ 130 lines of data generation and comments)
"""Short and sweet LSTM implementation in Tensorflow.
Motivation:
When Tensorflow was released, adding RNNs was a bit of a hack - it required
building separate graphs for every number of timesteps and was a bit obscure
to use. Since then TF devs added things like `dynamic_rnn`, `scan` and `map_fn`.
Currently the APIs are decent, but all the tutorials that I am aware of are not
making the best use of the new APIs.
Advantages of this implementation:
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
  1. General Background and Overview
"""
This is a batched LSTM forward and backward pass
"""
import numpy as np
import code
class LSTM:
@staticmethod
def init(input_size, hidden_size, fancy_forget_bias_init = 3):
// Adapted from Rob Norris' post at https://tpolecat.github.io/2014/04/11/scalac-flags.html
scalacOptions ++= Seq(
"-deprecation",
"-encoding", "UTF-8", // yes, this is 2 args
"-feature",
"-unchecked",
"-Xfatal-warnings",
"-Xlint",
"-Yno-adapted-args",