Vicki Boykis veekaybee

## searchrecs.md

      
              1 file
            
          
              1 fork
            
          
              0 comments
            
          
              21 stars
            
          
                veekaybee
                / searchrecs.md
            
            
              Last active
              January 22, 2024 13:53
            
              
                Understanding search and recommendations
              
          
    How are search and recommendations the same, and how are they different?


Question on Mastodon
Question on Twitter

TL;DR:

The design of both search and recommendations is to find and filter information
Search is a "recommendation with a null query"
Search is "I want this", recommendations is "you might like this"


## chatgpt.md

      
              1 file
            
          
              38 forks
            
          
              4 comments
            
          
              337 stars
            
          
                veekaybee
                / chatgpt.md
            
            
              Last active
              April 12, 2024 20:16
            
              
                Everything I understand about chatgpt
              
          
    ChatGPT Resources

Context

ChatGPT appeared like an explosion on all my social media timelines in early December 2022. While I keep up with machine learning as an industry, I wasn't focused so much on this particular corner, and all the screenshots seemed like they came out of nowhere. What was this model? How did the chat prompting work? What was the context of OpenAI doing this work and collecting my prompts for training data?
I decided to do a quick investigation. Here's all the information I've found so far. I'm aggregating and synthesizing it as I go, so it's currently changing pretty frequently.
Model Architecture


## pyscript.html
<!DOCTYPE html>
<html lang="en">

<head>
    <meta charset="utf-8">
    <meta name="viewport" content="width=device-width, initial-scale=1">
    <title>Some plotting</title>
    <link rel="stylesheet" href="https://pyscript.net/alpha/pyscript.css" />
    <script defer src="https://pyscript.net/alpha/pyscript.js"></script>
    <py-env>

## readme.md

      
              2 files
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                veekaybee
                / readme.md
            
            
              Created
              October 30, 2022 15:20
            
          
    To run:
dot -Tpng trie.dot -o trie.png

  
## simpleScaldingJob.sc
import com.twitter.scalding._

class WordCountJob(args: Args) extends Job(args) {

val lines = TypedPipe.from(TextLine("posts.txt"))

lines.flatMap { line => tokenize(line) }
    .groupBy { word => word }
    .size
    .groupAll

## concatentateFiles.sc
"com.lihaoyi" %% "os-lib" % "0.7.8"

// Clone my static site repo, loop through posts and get all files as a single file

val wd = os.pwd / "_posts"
val sd = os.Path("/Users/vicki/IdeaProjects/scalding/scalding-repl")

// Concatentates all the files
os.write.over(
  wd / "posts.md",

## distance.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                veekaybee
                / distance.md
            
            
              Last active
              December 30, 2021 15:41
            
              
                Different Distance Measures
              
          
    Jaccard Similarity

import numpy as numpy
import typing
 
a = [1,2,3,4,5,11,12]
b = [2,3,4,5,6,8,9]

cats = ["calico", "tabby", "tom"]

  
## tenderness.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              1 star
            
          
                veekaybee
                / tenderness.md
            
            
              Last active
              April 12, 2019 14:50
            
          
My translated lyrics for Нежность, Tenderness.
Sung by Maya Krisalinskaya
Context
Translation:
Without you, the earth became empty.

  
## keybase.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                veekaybee
                / keybase.md
            
            
              Created
              September 20, 2017 15:43
            
          
    Keybase proof

I hereby claim:

I am veekaybee on github.
I am veekaybee (https://keybase.io/veekaybee) on keybase.
I have a public key ASC1BmRUMCaXHMnJ2DzEnxIyypbZqJmYGJIbCxhhrrSZKgo

To claim this, I am signing this object:

  
## wholesome-data-science.md

      
              1 file
            
          
              0 forks
            
          
              1 comment
            
          
              5 stars
            
          
                veekaybee
                / wholesome-data-science.md
            
            
              Last active
              August 16, 2019 06:40
            
              
                Wholesome data science. 
              
          
    Wholesome Data Science

Data science has a really bad reputation recently. Between Facebook's privacy violations ,  facial scanning at kiosks in restaurants, and racism in algorithms, there are a lot of cases where surveillance, invasion of privacy, and unethical algorithms are dominating the news.
These cases are really important to make public, study, and prevent. But it's just as important to collect examples of good use cases of data science (that are not hyperbolized or PR fluff) so we can focus on those as an industry, and learn about what makes them work, as well.
Have some? Make some? Feel free to leave a comment or edit.
Examples
	<!DOCTYPE html>
	<html lang="en">

	<head>
	<meta charset="utf-8">
	<meta name="viewport" content="width=device-width, initial-scale=1">
	<title>Some plotting</title>
	<link rel="stylesheet" href="https://pyscript.net/alpha/pyscript.css" />
	<script defer src="https://pyscript.net/alpha/pyscript.js"></script>
	<py-env>
	import com.twitter.scalding._

	class WordCountJob(args: Args) extends Job(args) {

	val lines = TypedPipe.from(TextLine("posts.txt"))

	lines.flatMap { line => tokenize(line) }
	.groupBy { word => word }
	.size
	.groupAll
	"com.lihaoyi" %% "os-lib" % "0.7.8"

	// Clone my static site repo, loop through posts and get all files as a single file

	val wd = os.pwd / "_posts"
	val sd = os.Path("/Users/vicki/IdeaProjects/scalding/scalding-repl")

	// Concatentates all the files
	os.write.over(
	wd / "posts.md",