Toby DiPasquale codeslinger

## normcore-llm.md

      
              1 file
            
          
              207 forks
            
          
              38 comments
            
          
              2706 stars
            
          
                veekaybee
                / normcore-llm.md
            
            
              Last active
              April 23, 2024 16:03
            
              
                Normcore LLM Reads
              
          
    Anti-hype LLM reading list

Goals: Add links that are reasonable and good explanations of how stuff works. No hype and no vendor content if possible. Practical first-hand accounts of models in prod eagerly sought.
Foundational Concepts


Pre-Transformer Models


## macOS Internals.md

      
              1 file
            
          
              86 forks
            
          
              4 comments
            
          
              1588 stars
            
          
                kconner
                / macOS Internals.md
            
            
              Last active
              April 22, 2024 21:28
            
              
                macOS Internals
              
          
    macOS Internals

Understand your Mac and iPhone more deeply by tracing the evolution of Mac OS X from prelease to Swift. John Siracusa delivers the details.
Starting Points

How to use this gist

You've got two main options:

  
## jargon.md

      
              1 file
            
          
              28 forks
            
          
              9 comments
            
          
              179 stars
            
          
                cb372
                / jargon.md
            
            
              Last active
              May 8, 2023 16:03
            
              
                Category theory jargon cheat sheet
              
          
    Category theory jargon cheat sheet

A primer/refresher on the category theory concepts that most commonly crop up in conversations about Scala or FP. (Because it's embarassing when I forget this stuff!)
I'll be assuming Scalaz imports in code samples, and some of the code may be pseudo-Scala.
Functor

A functor is something that supports map.

  
## git-squash
#!/bin/sh

git rebase --interactive --autosquash \
    $(git merge-base $(git symbolic-ref --short HEAD) master)

## generate-pushid.js
/**
 * Fancy ID generator that creates 20-character string identifiers with the following properties:
 *
 * 1. They're based on timestamp so that they sort *after* any existing ids.
 * 2. They contain 72-bits of random data after the timestamp so that IDs won't collide with other clients' IDs.
 * 3. They sort *lexicographically* (so the timestamp is converted to characters that will sort properly).
 * 4. They're monotonically increasing.  Even if you generate more than one in the same timestamp, the
 *    latter ones will sort after the former ones.  We do this by using the previous random bits
 *    but "incrementing" them by 1 (only in the case of a timestamp collision).
 */

## bot-verbs.txt
Philosophical:
* writing lines of code is one of the least valuable things I can do
* cultivate your impatience
* reject the status quo, unless we can re-derive it from first-principles
* engineers imprint on the first languages (techniuqes, frameworks or technology) that we find success with (unconsciously seen as caregivers, which we defend w/o always knowing why)
  * we tend to overvalue the familiar/known; we tend to undervalue the unfamiliar/unknown, this interferes with our receptiveness to new ideas and personal growth
* we're 90% composed of bad habits; many of our best habits become bad as time passes; this allows us to filter for the fundamental; the great
* make doing the right thing easier than any other thing, or we will fail to achieve greatness, or break bad habits
* be conscious, be intentional
* "is this the highest we can aim?" (I prefer this over "is this the best we can do", the former is aspirational, the latter is judgemental)

## linkedout.js
// 1. Go to page https://www.linkedin.com/settings/email-frequency
// 2. You may need to login
// 3. Open JS console
//    ([How to?](http://webmasters.stackexchange.com/questions/8525/how-to-open-the-javascript-console-in-different-browsers))
// 4. Copy the following code in and execute
// 5. No more emails
//
// Bookmarklet version:
// http://chengyin.github.io/linkedin-unsubscribed/

## Encbox.md

      
              1 file
            
          
              39 forks
            
          
              17 comments
            
          
              387 stars
            
          
                Tho85
                / Encbox.md
            
            
              Last active
              February 4, 2023 18:30
            
          
    Build your own private, encrypted, open-source Dropbox-esque sync folder

Prerequisites:

One or more clients running a UNIX-like OS. Examples are given for Ubuntu 12.04 LTS, although all software components are available for other platforms as well (e.g. OS X). YMMV
A cheap Ubuntu 12.04 VPS with storage. I recommend Backupsy, they offer 250GB storage for $5/month. Ask Google for coupon codes.

Software components used:

Unison for file synchronization
EncFS for folder encryption


## progress.go
package main

import (
	"fmt"
	"io"
	"os"
	"time"

	"github.com/dustin/go-humanize"
)

## list.md

      
              1 file
            
          
              10 forks
            
          
              6 comments
            
          
              49 stars
            
          
                pbailis
                / list.md
            
            
              Last active
              April 15, 2018 08:54
            
              
                Quick and dirty (incomplete) list of interesting, mostly recent data warehousing/"big data" papers
              
          
    A friend asked me for a few pointers to interesting, mostly recent papers on data warehousing and "big data" database systems, with an eye towards real-world deployments. I figured I'd share the list. It's biased and rather incomplete but maybe of interest to someone. While many are obvious choices (I've omitted several, like MapReduce), I think there are a few underappreciated gems.
###Dataflow Engines:
Dryad--general-purpose distributed parallel dataflow engine

http://research.microsoft.com/en-us/projects/dryad/eurosys07.pdf
Spark--in memory dataflow

http://www.cs.berkeley.edu/~matei/papers/2012/nsdi_spark.pdf
	#!/bin/sh

	git rebase --interactive --autosquash \
	$(git merge-base $(git symbolic-ref --short HEAD) master)
	/**
	* Fancy ID generator that creates 20-character string identifiers with the following properties:
	*
	* 1. They're based on timestamp so that they sort after any existing ids.
	* 2. They contain 72-bits of random data after the timestamp so that IDs won't collide with other clients' IDs.
	* 3. They sort lexicographically (so the timestamp is converted to characters that will sort properly).
	* 4. They're monotonically increasing. Even if you generate more than one in the same timestamp, the
	* latter ones will sort after the former ones. We do this by using the previous random bits
	* but "incrementing" them by 1 (only in the case of a timestamp collision).
	*/
	Philosophical:
	* writing lines of code is one of the least valuable things I can do
	* cultivate your impatience
	* reject the status quo, unless we can re-derive it from first-principles
	* engineers imprint on the first languages (techniuqes, frameworks or technology) that we find success with (unconsciously seen as caregivers, which we defend w/o always knowing why)
	* we tend to overvalue the familiar/known; we tend to undervalue the unfamiliar/unknown, this interferes with our receptiveness to new ideas and personal growth
	* we're 90% composed of bad habits; many of our best habits become bad as time passes; this allows us to filter for the fundamental; the great
	* make doing the right thing easier than any other thing, or we will fail to achieve greatness, or break bad habits
	* be conscious, be intentional
	* "is this the highest we can aim?" (I prefer this over "is this the best we can do", the former is aspirational, the latter is judgemental)
	// 1. Go to page https://www.linkedin.com/settings/email-frequency
	// 2. You may need to login
	// 3. Open JS console
	// ([How to?](http://webmasters.stackexchange.com/questions/8525/how-to-open-the-javascript-console-in-different-browsers))
	// 4. Copy the following code in and execute
	// 5. No more emails
	//
	// Bookmarklet version:
	// http://chengyin.github.io/linkedin-unsubscribed/
	package main

	import (
	"fmt"
	"io"
	"os"
	"time"

	"github.com/dustin/go-humanize"
	)