Pratyay Banerjee Neilblaze

## embeddings.py

"""
a simple script that reads tweets inside a json file, uses openai to compute embeddings and creates two files, metadata.tsv and output.tsv, which cam be used to visualise the tweets and their embeddings in TensorFlow Projector (https://projector.tensorflow.org/)
"""

# obtain tweets.json from https://gist.github.com/gd3kr/948296cf675469f5028911f8eb276dbc

import pandas as pd
import json
from openai import OpenAI

## fast_conformer_nemo.ipynb

      
              1 file
            
          
              0 forks
            
          
              1 comment
            
          
              4 stars
            
          
                titu1994
                / fast_conformer_nemo.ipynb
            
            
              Last active
              May 22, 2024 20:22
            
          
        Loading

      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## Apps.jsx
import styles from './Apps.module.scss';
import { useEffect, useState } from 'react';
import Link from 'next/link';

const APPS = [
	{
		title: 'APP',
		hero: 'Lorem ipsum dolor sit amet',
		description:
			'Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do.',

## normcore-llm.md

      
              1 file
            
          
              218 forks
            
          
              38 comments
            
          
              2781 stars
            
          
                veekaybee
                / normcore-llm.md
            
            
              Last active
              July 21, 2024 13:28
            
              
                Normcore LLM Reads
              
          
    Anti-hype LLM reading list

Goals: Add links that are reasonable and good explanations of how stuff works. No hype and no vendor content if possible. Practical first-hand accounts of models in prod eagerly sought.
Foundational Concepts


Pre-Transformer Models


## code-assist.md

      
              1 file
            
          
              3 forks
            
          
              5 comments
            
          
              32 stars
            
          
                Birch-san
                / code-assist.md
            
            
              Last active
              March 4, 2024 19:32
            
              
                Local VSCode AI code assistance via starcoder + 4-bit quantization in ~11GB VRAM
              
          
    Install HF Code Autocomplete VSCode plugin.
We are not going to set an API token. We are going to specify an API endpoint.

We will try to deploy that API ourselves, to use our own GPU to provide the code assistance.
We will use bigcode/starcoder, a 15.5B param model.

We will use NF4 4-bit quantization to fit this into 10787MiB VRAM.

It would require 23767MiB VRAM unquantized. (still fits on a 4090, which has 24564MiB)!
Setup API


## add-missing-import-extensions.txt
# Find
import\s(([^;]|\n)*)\sfrom\s(['"])(\.{1,2}\/.*)(?<!\.js)(?<!\.(css|pdf|png|jpg|jsx|mjs|mp3|mp4|svg|ttf))(?<!\.(avif|json|webm|webp|woff))(?<!\.woff2)(['"]);

# Replace with
import $1 from $3$4.js$7;

## gpt4_abbreviations.md

      
              1 file
            
          
              6 forks
            
          
              10 comments
            
          
              147 stars
            
          
                VictorTaelin
                / gpt4_abbreviations.md
            
            
              Last active
              June 18, 2024 15:03
            
              
                Notes on the GPT-4 abbreviations tweet
              
          
    Notes on this tweet.


The screenshots were taken on different sessions.


The entire sessions are included on the screenshots.


I lost the original prompts, so I had to reconstruct them, and still managed to reproduce.


The "compressed" version is actually longer! Emojis and abbreviations use more tokens than common words.


## README.md

      
              3 files
            
          
              3 forks
            
          
              0 comments
            
          
              6 stars
            
          
                veekaybee
                / README.md
            
            
              Last active
              June 27, 2024 19:54
            
              
                whisper.ipynb
              
          
    Using Whisper to transcribe audio

This episode of Recsperts was transcribed with Whisper from OpenAI, an open-source neural net trained on almost 700 hours of audio. The model includes an encoder-decoder architecture by tokenizing audio into 30-second chunks, normalizing audio samples to the log-Mel scale, and passing the data into an encoder. A decoder is trained to predict the captioned text matching the encoder, and the model includes transcription, as well as timestamp-aligned transcription, and multilingual translation.

The transcription process outputs a single string file, so it's up to the end-user to parse out individual speakers, or run the model [through a sec

  
## work-with-multiple-github-accounts.md

      
              1 file
            
          
              328 forks
            
          
              99 comments
            
          
              1074 stars
            
          
                rahularity
                / work-with-multiple-github-accounts.md
            
            
              Last active
              July 25, 2024 14:43
            
              
                How To Work With Multiple Github Accounts on your PC
              
          
    How To Work With Multiple Github Accounts on a single Machine

Let suppose I have two github accounts, https://github.com/rahul-office and https://github.com/rahul-personal. Now i want to setup my mac to easily talk to both the github accounts.

NOTE: This logic can be extended to more than two accounts also. :)

The setup can be done in 5 easy steps:
Steps:


Step 1 : Create SSH keys for all accounts
Step 2 : Add SSH keys to SSH Agent


## ipfs-server-setup.md

      
              1 file
            
          
              24 forks
            
          
              8 comments
            
          
              69 stars
            
          
                claus
                / ipfs-server-setup.md
            
            
              Last active
              May 9, 2023 03:51
            
              
                Host Your Site Under Your Domain on IPFS
              
          
    Host Your Site Under Your Domain on IPFS

This is a step-by-step tutorial for hosting your website under your domain on IPFS, from zero, on a DigitalOcean Ubuntu 16.04.3 x64 Droplet (i am using the $10 variant with 2GB RAM).
Install IPFS

Log in as root.
First, make sure the system is up to date, and install tar and wget:

	"""
	a simple script that reads tweets inside a json file, uses openai to compute embeddings and creates two files, metadata.tsv and output.tsv, which cam be used to visualise the tweets and their embeddings in TensorFlow Projector (https://projector.tensorflow.org/)
	"""

	# obtain tweets.json from https://gist.github.com/gd3kr/948296cf675469f5028911f8eb276dbc

	import pandas as pd
	import json
	from openai import OpenAI
	import styles from './Apps.module.scss';
	import { useEffect, useState } from 'react';
	import Link from 'next/link';

	const APPS = [
	{
	title: 'APP',
	hero: 'Lorem ipsum dolor sit amet',
	description:
	'Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do.',
	# Find
	import\s(([^;]\|\n))\sfrom\s(['"])(\.{1,2}\/.)(?<!\.js)(?<!\.(css\|pdf\|png\|jpg\|jsx\|mjs\|mp3\|mp4\|svg\|ttf))(?<!\.(avif\|json\|webm\|webp\|woff))(?<!\.woff2)(['"]);

	# Replace with
	import $1 from $3$4.js$7;