- 2011 - A trip through the Graphics Pipeline 2011
- 2015 - Life of a triangle - NVIDIA's logical pipeline
- 2015 - Render Hell 2.0
- 2016 - How bad are small triangles on GPU and why?
- 2017 - GPU Performance for Game Artists
- 2019 - Understanding the anatomy of GPUs using Pokémon
- 2020 - GPU ARCHITECTURE RESOURCES
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
from __future__ import annotations | |
from contextlib import contextmanager | |
from typing import NamedTuple, Callable, Optional, Any | |
import numpy as np | |
Array = Any | |
class Node(NamedTuple): | |
vjp: Optional[Callable] | |
parents: List[Node] |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Shader "WorldNormalFromDepthTexture" | |
{ | |
Properties { | |
[KeywordEnum(3 Tap, 4 Tap, Improved, Accurate)] _ReconstructionMethod ("Normal Reconstruction Method", Float) = 0 | |
} | |
SubShader | |
{ | |
Tags { "RenderType"="Transparent" "Queue"="Transparent" } | |
LOD 100 |
This is a short post that explains how to write a high-performance matrix multiplication program on modern processors. In this tutorial I will use a single core of the Skylake-client CPU with AVX2, but the principles in this post also apply to other processors with different instruction sets (such as AVX512).
Matrix multiplication is a mathematical operation that defines the product of
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
import tensorflow as tf | |
# 1. Create and save two graphs | |
# c = a*b | |
g1 = tf.Graph() | |
with g1.as_default(): | |
a = tf.placeholder(tf.float32, name='a') | |
b = tf.Variable(initial_value=tf.truncated_normal((1,)), name='b') |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
# Set-ExecutionPolicy Bypass -Scope CurrentUser # to further speedup powershell startup | |
$PROFILE_DIR = Split-Path -Path $PROFILE.CurrentUserAllHosts -Parent | |
$OLD = $PROFILE | |
$PROFILE = Join-Path $PROFILE_DIR "profile.ps1" | |
$PROFILE | Add-Member -NotePropertyName AllUsersAllHosts -NotePropertyValue $OLD.AllUsersAllHosts | |
$PROFILE | Add-Member -NotePropertyName AllUsersCurrentHost -NotePropertyValue $OLD.AllUsersCurrentHost | |
$PROFILE | Add-Member -NotePropertyName CurrentUserAllHosts -NotePropertyValue $PROFILE.ToString() | |
$PROFILE | Add-Member -NotePropertyName CurrentUserCurrentHost -NotePropertyValue $PROFILE.ToString() | |
Clear-Variable OLD |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original paper on A-normal form: | |
http://redlinernotes.com/docs/Classics/The%20Essence%20of%20Compiling%20with%20Continuations.pdf | |
A high-level intro to ANF: | |
http://matt.might.net/articles/a-normalization/ | |
One of the earlier attempts to relate SSA and CPS: |