Skip to content

Instantly share code, notes, and snippets.

View brianspiering's full-sized avatar

Brian Spiering brianspiering

  • San Francisco, CA, USA
View GitHub Profile
@brianspiering
brianspiering / install_pyspark_on_m1_mac.md
Created February 24, 2022 20:58
Installation guide to pyspark on M1 Mac

Install Spark

Run all of these commands at the command line (not in a Jupyter Notebook). The command line will have more informative error messages and if we need complete additional steps, we'll get the messages.

Spark is a framework within the Scala programming language. Scala uses the JVM (Java Virtual Machine) so you'll need install Java.

If you use homebrew:

@brianspiering
brianspiering / python_nlp_packages.md
Last active April 8, 2024 20:48
A Hacker's Guide to Python string and Natural Language Processing (NLP) packages

A Hacker's Guide to Python string and Natural Language Processing (NLP) packages

Extraction

  • textract - Extract text from any document.
  • camelot - Extract text from PDF.

Preprocessing

@brianspiering
brianspiering / zvex_fuzz_factory_guitar_pedal_settings.md
Last active September 9, 2023 22:40
ZVex Fat Fuzz Factory Vexter guitar pedal settings

Stock Sounds

"It's a little stock"
— Lars Ulrich

Nickname Gate Comp Drive Stab Personal Comments
Mellow Mud 2:30 7:00 7:00 9:00 Wall of fuzz with a little bite.
Tone Bender Fuzz 2:00 8:30 9:00+ 9:15 Cleans up with guitar volume.
@brianspiering
brianspiering / debugging_advice_for_programming.md
Last active July 20, 2023 16:57
Debugging advice for programming

What to do when your code doesn't work

Follow these steps:

  1. Don't Panic! Relax and realize that you will solve this problem, even if it takes a little bit of messing around. Banging your head against the computer is part of your job (both as a student and as a professional programmer). Remember that the computer is doing precisely what you are telling it to do. There is no magic.

  2. Determine precisely what is going on. Did you get an error message from Python? If it is a SyntaxError, a helpful guide is here. Also, running your code in Python 3.11 or higher has improved error messages.

  3. Python error messages include a stack trace. There could be a number errors throughout the stack. You read a stack trace from bottom to top. Go slowly and understand each character and each line.

@brianspiering
brianspiering / .bash_profile
Last active June 16, 2023 18:08
This file holds all my BASH configurations and aliases
# ---------------------------------------------------------------------------
#
# Description: This file holds all my BASH configurations and aliases
#
# Sections:
# 1. Environment Configuration
# 2. Make Terminal Better (remapping defaults and adding functionality)
# 3. File and Folder Management
# 4. Searching
# 5. Process Management
{"the": 73088, "of": 27807, "to": 26508, "a": 23957, "and": 22283, "that": 18770, "in": 18187, "is": 11277, "for": 8947, "on": 8109, "with": 7253, "it": 7226, "as": 6476, "was": 5736, "but": 5488, "i": 5287, "by": 5238, "this": 5084, "be": 4795, "at": 4529, "have": 4453, "his": 4399, "he": 4352, "its": 4343, "are": 4291, "not": 4280, "an": 4175, "about": 4092, "has": 3767, "from": 3697, "you": 3416, "new": 3273, "one": 2986, "or": 2928, "who": 2912, "all": 2869, "times": 2787, "more": 2783, "will": 2777, "which": 2715, "nyt": 2690, "they": 2621, "if": 2473, "story": 2441, "would": 2436, "their": 2330, "out": 2312, "says": 2286, "what": 2282, "wp": 2131, "so": 2108, "clinton": 2047, "than": 2034, "been": 1956, "had": 1955, "were": 1923, "when": 1911, "up": 1902, "us": 1887, "also": 1866, "no": 1813, "post": 1807, "there": 1779, "her": 1744, "lat": 1735, "like": 1700, "some": 1601, "we": 1590, "other": 1578, "just": 1541, "do": 1508, "people": 1464, "because": 1447, "how": 1440, "now": 1429, "only": 1405, "repo
@brianspiering
brianspiering / week_4_prompts.md
Last active May 7, 2023 17:01
Week 4 prompts
  1. Review Prompts
    • Summarize in 2-3 setences traditional meaning representation (i.e., video 4.1-4.4).
    • Summarize in 2-3 setences Statistical Machine Translation (MT)
    • Why are Neural Machine Translation systems more popular right now than than traditional meaning representations?
    • Why are attention-based architectures the current SOTA (state of the art)?
  2. Discussion Prompts
    • If you know a langauage other than English:
      • What is the most difficult phrase or idea to translate into English?
      • How well Google Translate perform?
  • If there is a Google Translate error, what could be the cause of it?
@brianspiering
brianspiering / week_3_prompts.md
Last active May 4, 2023 23:23
Week 3 Prompts

Review Prompts

  1. Word Embeddings
  • Why are word embeddings useful?
  • What are the inputs?
  • What are the outputs?
  1. Recurrent Neural Network (RNN), Gated Recurrent Unit (GRU), Long Short Term Memory (LSTM)
@brianspiering
brianspiering / .zshrc
Last active May 2, 2023 17:24
My zsh profile
# ---------------------------------------------------------------------------
#
# Description: This file holds all my oh-my-zsh configurations and aliases
#
# locaction
# ~/.zshrc
# open with VSCode
# code ~/.zshrc
# ---------------------------------------------------------------------------