Skip to content

Instantly share code, notes, and snippets.

# Noteworthy CPU Features
SSE 128-bit vector instructions (e.g. addps http://www.felixcloutier.com/x86/ADDPS.html)
SSE4.1 Makes SSE good
SSE4.2 Makes SSE do strings and CRC32
AVX Makes SSE 256-bit (e.g. vaddps)
AVX2 Adds features to AVX
AVX-512 Makes SSE 512-bit
SHA Makes SSE do SHA1 and SHA256
FMA Fused multiply–add
CLMUL Carry-less multiplication https://goo.gl/ritKJX
@jart
jart / .bashrc.tensorboard
Last active April 16, 2018 10:21
Bash Python Profiling for TensorBoard
# -*-sh-*-
# pip install yappi
# tensorboard-profile-wall-time --logdir=/tmp/mnist
tb-profile-wall-time() {
-tensorboard-profile wall "$@"
}
tb-profile-cpu-time() {
-tensorboard-profile cpu "$@"
@jart
jart / sip.rl
Created March 12, 2018 08:18
gossip sip parser definition
%%{# -*-ragel-*-
#
## SIP Message Parser Definition
#
# The parser should be a linearly complex FSM. It should be able to parse a
# real-world SIP INVITE in less than 30μs. This parser only trades speed in
# favor of friendly data structures, which are big structs linked together.
#
## Implementation Notes
#
@jart
jart / blakefiler.py
Last active September 18, 2023 16:22
Turns bazel query --output=build //tensorflow:libtensorflow_framework.so into isomorphic Makefile
# Copyright 2017 The TensorFlow Authors. All Rights Reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
@jart
jart / .nanorc
Last active January 17, 2018 03:50
set autoindent
set cut
set tabsize 2
set tabstospaces
set nohelp
set morespace
bind ^S whereis main
bind M-F nextword main
bind M-B prevword main
bind M-{ beginpara main
@jart
jart / build-tf-nightly.sh
Last active January 13, 2018 21:47
TensorFlow Nightly Ubuntu 14 Workaround Ops
#!/bin/bash
# TensorFlow Nightly Python 2.7 Slow CPU Package Builder on Ubuntu 14
#
# This script is a temporary hackish workaround until the TensorFlow team can
# setup a heterogeneous tooling environment that builds Python<=3.4 CPU pip
# packages using Ubuntu 14, due to challenging constraints imposed by vendors.
#
# It uploads poor quality builds of TensorFlow to GCS for testing environments:
#
# pip install --no-cache-dir -I https://tensorboard-builds.storage.googleapis.com/tf_nightly_cpu_slow_ubuntu14-123-cp27-cp27mu-linux_x86_64.whl
/********************************************************\
* *
* Ultra-hot screen management package *
* *
\********************************************************/
/***********************************************************-*****
@jart
jart / proxy.py
Created November 10, 2017 20:55
Authorizing HTTP Proxy
import BaseHTTPServer
import SocketServer
import base64
import httplib
import shutil
import urlparse
basic = lambda u,p: 'Basic %s' % base64.b64encode('%s:%s' % (u,p))
AUTHORIZATIONS = {
@jart
jart / .jartps1.sh
Last active November 29, 2023 00:27
PS1 with low latency git branch detection
#!/bin/bash
# jartps1.sh - Enhanced bash prompt
# Author: Justine Tunney <jart@google.com>
# Modified: 2023-11-28
#
# LIST OF BASH PROMPT ENHANCEMENTS
#
# - No latency issues when holding down enter key
# - Doesn't look unfamiliar compared to bash defaults
# - Show current git branch
@jart
jart / WORKSPACE
Last active October 29, 2018 06:10
Google Auto and Dagger Bazel Config
load("@bazel_tools//tools/build_defs/repo:java.bzl", "java_import_external")
java_import_external(
name = "com_google_auto_common",
jar_sha256 = "eee75e0d1b1b8f31584dcbe25e7c30752545001b46673d007d468d75cf6b2c52",
jar_urls = [
"http://domain-registry-maven.storage.googleapis.com/repo1.maven.org/maven2/com/google/auto/auto-common/0.7/auto-common-0.7.jar",
"http://repo1.maven.org/maven2/com/google/auto/auto-common/0.7/auto-common-0.7.jar",
],
licenses = ["notice"], # Apache 2.0