Skip to content

Instantly share code, notes, and snippets.

Qiyuan Gong qiyuangong

Block or report user

Report or block qiyuangong

Hide content and notifications from this user.

Learn more about blocking users

Contact Support about this user’s behavior.

Learn more about reporting abuse

Report abuse
View GitHub Profile
nadavrot /
Last active Oct 18, 2019
Efficient matrix multiplication

High-Performance Matrix Multiplication

This is a short post that explains how to write a high-performance matrix multiplication program on modern processors. In this tutorial I will use a single core of the Skylake-client CPU with AVX2, but the principles in this post also apply to other processors with different instruction sets (such as AVX512).


Matrix multiplication is a mathematical operation that defines the product of

View Build tensorflow on OSX with NVIDIA CUDA

Build tensorflow on OSX with NVIDIA CUDA support (GPU acceleration)

These instructions are based on Mistobaan's gist but expanded and updated to work with the latest tensorflow OSX CUDA PR.


OS X 10.10 (Yosemite) or newer



###功能 基于DNSPOD (的动态域名解析脚本:


###原理 基于dnspod提供的api,提交信息。 如果IP地址没有改变,则不处理;如果改变了则提交新的IP地址。

You can’t perform that action at this time.