yada trojblue

## comfyui_api_example.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                trojblue
                / comfyui_api_example.md
            
            
              Created
              June 30, 2024 19:37
            
          
enable "dev usage" in comfy settings to export api_workflow.json:


https://github.com/comfyanonymous/ComfyUI/blob/master/script_examples/basic_api_example.py


use the script:

import websocket #NOTE: websocket-client (https://github.com/websocket-client/websocket-client)
import uuid
import json

  
## pandas_memory_optimize.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                trojblue
                / pandas_memory_optimize.md
            
            
              Last active
              June 19, 2024 08:07
            
          
    用几种办法来减少dataframe占用的内存:

去掉信息重复的columns
提前去掉不需要的行
转换数字到最小精度(-50%)
转换Python string (objects)为pyarrow str (-30%)
转换date string为pd datetime (-85%)
转换大量重复出现的string为category (-95%)


## kedro_dynamic_pipeline.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                trojblue
                / kedro_dynamic_pipeline.md
            
            
              Last active
              June 12, 2024 23:44
            
          
    using functools.partial to pass in real arguments into kedro:
from functools import partial, update_wrapper
from kedro.pipeline import Pipeline, node

from .nodes import process_todo, DemoMerger


def create_wrapped_partial(func, *args, **kwargs):

  
## plot_quadratics.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                trojblue
                / plot_quadratics.md
            
            
              Created
              March 28, 2024 18:26
            
          
    import numpy as np
import matplotlib.pyplot as plt

def plot_quadratic_coefficients(coefficients):
    """
    Plots y = ax^2 + bx + c for each set of coefficients within specified x and y ranges.

    Parameters:
    - coefficients: dict, a dictionary of coefficient sets with 'a', 'b', and 'c' for each key.

  
## visualize_tag_counts.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                trojblue
                / visualize_tag_counts.md
            
            
              Last active
              March 12, 2024 20:41
            
          
    用来数出df里某列 tag counts数量, 然后可视化的代码:
def safe_split_tag_str(tag_str, separator=","):
    """
    Splits a tag string into a list of non-empty, whitespace-stripped tag strings.
    """
    if not tag_str:
        return []

  
## parquet_splitter_usage.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                trojblue
                / parquet_splitter_usage.md
            
            
              Last active
              March 10, 2024 14:54
            
          
    (pixiv-data-process/yada/13_pixiv_streamlined.ipynb)
输入一个(本地或者s3地址), 返回包含了所有文件的列表, 上传图片-meta的关系到s3:
(没那么多数据的时候可以直接这么用:)
# https://github.com/troph-team/build-it/blob/f996fe55a6fd2beda9e62a6624be0f0fe2a05848/buildit/sagemaker/parquet_splitter.py#L13
import os
from dataproc3.sagemaker import ParquetSplitter

  
## lambda_h100_setup.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                trojblue
                / lambda_h100_setup.md
            
            
              Created
              December 10, 2023 23:04
            
          
    nd setup, works on lambda h100 pcie:
conda:
cd ~/ && mkdir -p miniconda3 && wget https://repo.anaconda.com/miniconda/Miniconda3-py310_23.5.2-0-Linux-x86_64.sh -O ./miniconda3/miniconda.sh --no-check-certificate && bash ./miniconda3/miniconda.sh -b -u -p ./miniconda3 && rm ./miniconda3/miniconda.sh && ./miniconda3/bin/conda init bash && source ~/.bashrc  && python -m pip install unibox ipykernel jupyter poetry && python -m ipykernel install --user --name=conda310 

nd:


## extract_url_from_artstation_json.py
import json

# Function to extract handles from a given domain in a nested dictionary
def extract_handles(data, domain):
    def find_handles(d):
        handles = []
        for k, v in d.items():
            if isinstance(v, dict):
                handles.extend(find_handles(v))
            elif isinstance(v, list):

## cuda_11.8_installation_on_Ubuntu_22.04
#!/bin/bash

### steps ####
# verify the system has a cuda-capable gpu
# download and install the nvidia cuda toolkit and cudnn
# setup environmental variables
# verify the installation
# https://gist.github.com/MihailCosmin/affa6b1b71b43787e9228c25fe15aeba
###

## pytthon_debug.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                trojblue
                / pytthon_debug.md
            
            
              Created
              June 27, 2023 02:50
            
              
                common python debug commands
              
          
    save to txt:
my_list =  responses[56]

with open('my_file2.txt', 'w', encoding="utf-8") as f:
    for item in my_list:
        f.write("%s\n" % item)
save to clipboard:
	import json

	# Function to extract handles from a given domain in a nested dictionary
	def extract_handles(data, domain):
	def find_handles(d):
	handles = []
	for k, v in d.items():
	if isinstance(v, dict):
	handles.extend(find_handles(v))
	elif isinstance(v, list):
	#!/bin/bash

	### steps ####
	# verify the system has a cuda-capable gpu
	# download and install the nvidia cuda toolkit and cudnn
	# setup environmental variables
	# verify the installation
	# https://gist.github.com/MihailCosmin/affa6b1b71b43787e9228c25fe15aeba
	###