Zubin J valiantone

## finetuning-llms.md

      
              3 files
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                valiantone
                / finetuning-llms.md
            
            
              Last active
              May 2, 2024 11:13
            
              
                Let's make some Wavess
              
          
    Wavess Guide to Fine-tuning LLMs

Wavess is a play to explore funding opportunities with Astrik in the marketing co-pilot AI B2B service space.
Ahead AI specializes in Machine Learning & AI research and is read by tens of thousands of researchers and practitioners who want to stay ahead in the ever-evolving field.
General Fine-tuning

Resources

Fine-tuning large language models (LLMs) in 2024
Fine-tuning open source large language models (LLMs)

  
## pandas_s3_streaming.py
def s3_to_pandas(client, bucket, key, header=None):

    # get key using boto3 client
    obj = client.get_object(Bucket=bucket, Key=key)
    gz = gzip.GzipFile(fileobj=obj['Body'])

    # load stream directly to DF
    return pd.read_csv(gz, header=header, dtype=str)

def s3_to_pandas_with_processing(client, bucket, key, header=None):

## window_functions.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                valiantone
                / window_functions.md
            
            
              Last active
              June 19, 2021 08:30
            
              
                SQL queries and examples
              
          
    Weekly, Monthly Active Emails

WITH
    -- this is your original query, with the ISO week and month number added.
    members_log_aggr(login_date,  year_nbr, iso_week_nbr, month_nbr, email_count) AS
    (
        SELECT
            CAST(ml.login AS Date),
            DATEPART(YEAR, ml.login),
            DATEPART(ISO_WEEK, ml.login),

  
## bellhops-archive.md

      
              4 files
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                valiantone
                / bellhops-archive.md
            
            
              Last active
              April 9, 2024 10:28
            
              
                Work Hard, Play Harder
              
          
    Transitioning from Tag Clouds to Tag Trees

In the packages paradigm - each packaged selection/offering is a bundle of semi-unique characteristics - lets term these as attributes.
When a package bundle is selected it instantly informs our controller that certain atrributes are ground truth for this move. The current process introduces assumptions rather than ground truth. For instance let's observe Use Case 1.
Use Case 1: Studio Package Bundle

Facts:
    - Two Bellhops on the move
    - Duration of ~ 2hours
    - 16 Foot Moving truck required


## Cool-Retro-Term-Windows10.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                valiantone
                / Cool-Retro-Term-Windows10.md
            
            
              Created
              April 2, 2021 09:50
                — forked from h3r/Cool-Retro-Term-Windows10.md
            
          
    Installing Cool-Retro-Term on Windows10

First of all, this document is just a recompilation of different resources that already existed on the web previously that I personally tested some ones did work and other not. I liked the idea to make a full guide from start to end so all of you could also enjoy playing with cool-retro-term on windows 10.
Personally I installed it on a windows 10 pro version. Fingers crossed!


## ds-track-snippets.md

      
              4 files
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                valiantone
                / ds-track-snippets.md
            
            
              Last active
              May 2, 2024 10:49
            
              
                Techlabs
              
          
    train = pd.DataFrame([
    {"Name": "Olyphant", "FamilySize": 1},
    {"Name": "Rodent", "FamilySize": 3},
    {"Name": "Possum", "FamilySize": 1},
])

sub = train[train["FamilySize"] == 1]
sub["isAlone"] = 1
train

  
## experimental-testing.md

      
              2 files
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                valiantone
                / experimental-testing.md
            
            
              Last active
              March 13, 2024 09:39
            
              
                Life Loops
              
          
    Statistical Digressions


T-test, ANOVA or Regression, what's the difference?

Hypothesis-driven testing


A/B testing by Julian Shapiro
A/B Testing with Machine Learning - A Step-by-Step Tutorial
Beyond A/B Testing: Multi-armed Bandit Experiments

Handling Network Effects


How to Optimize your Switchback A/B Test Configuration
Design and Analysis of Switchback Experiments


## dev-tools.md

      
              3 files
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                valiantone
                / dev-tools.md
            
            
              Last active
              April 9, 2024 10:10
            
              
                Onsight AnyWARE
              
          
    Developer Tools

osx setups


Mac Setup for Web Development [2022]
My dev setup 2022
macOS Setup after 15 Years of Linux
My 2021 New Mac Setup
Don't fall in love with your Mac—automate it!
macOS Setup Guide


## mongo_to_csv.py
# @Author: xiewenqian <int>
# @Date:   2016-11-28T20:35:09+08:00
# @Email:  wixb50@gmail.com
# @Last modified by:   int
# @Last modified time: 2016-12-01T19:32:48+08:00


import pandas as pd
from pymongo import MongoClient

## pandas_labeled_csv_import_to_mongo.py
import pandas as pd
from pymongo import MongoClient
import json

def mongoimport(csv_path, db_name, coll_name, db_url='localhost', db_port=27000)
    """ Imports a csv file at path csv_name to a mongo colection
    returns: count of the documants in the new collection
    """
    client = MongoClient(db_url, db_port)
    db = client[db_name]
	def s3_to_pandas(client, bucket, key, header=None):

	# get key using boto3 client
	obj = client.get_object(Bucket=bucket, Key=key)
	gz = gzip.GzipFile(fileobj=obj['Body'])

	# load stream directly to DF
	return pd.read_csv(gz, header=header, dtype=str)

	def s3_to_pandas_with_processing(client, bucket, key, header=None):
	# @Author: xiewenqian <int>
	# @Date: 2016-11-28T20:35:09+08:00
	# @Email: wixb50@gmail.com
	# @Last modified by: int
	# @Last modified time: 2016-12-01T19:32:48+08:00


	import pandas as pd
	from pymongo import MongoClient
	import pandas as pd
	from pymongo import MongoClient
	import json

	def mongoimport(csv_path, db_name, coll_name, db_url='localhost', db_port=27000)
	""" Imports a csv file at path csv_name to a mongo colection
	returns: count of the documants in the new collection
	"""
	client = MongoClient(db_url, db_port)
	db = client[db_name]