kshitizregmi

## pdf_compressor_ghostscript_without_loosing_data.bash
# for grayscale pdf
gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -sColorConversionStrategy=Gray -dProcessColorModel=/DeviceGray -dPDFSETTINGS=/ebook -dDownsampleColorImages=true -dColorImageResolution=144 -dDownsampleGrayImages=true -dGrayImageResolution=144 -dNOPAUSE -dQUIET -dBATCH -sOutputFile=cvttranscript.pdf transcripts.pdf


# for colored one
gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -dPDFSETTINGS=/ebook -dDownsampleColorImages=true -dColorImageResolution=144 -dDownsampleGrayImages=true -dGrayImageResolution=144 -dNOPAUSE -dQUIET -dBATCH -sOutputFile=converted.pdf original.pdf

## notebook_to_pdf_converter_python_script.txt
Install and Convert Jupyter Notebooks to PDF

This Gist provides a step-by-step guide to installing the necessary tools for converting Jupyter Notebooks (.ipynb) to PDF format using Python. It covers installing the required packages and tools (pip, nbconvert, notebook-as-pdf, pyppeteer, and playwright), and includes commands to convert notebooks either in bulk or individually to PDF format using the webpdf exporter.

Steps included:
1. Upgrade pip.
2. Install notebook-as-pdf and nbconvert.
3. Install pyppeteer and playwright for PDF rendering.
4. Convert all .ipynb files in a directory or a single notebook file to PDF.


## bq_schema_create_and_data_insert.py
from google.cloud import bigquery
import google.auth


class BigQuerySchema:
    def __init__(self):
        self.schema = []

    def add_fields(self, fields):
        for name, field_type in fields.items():

## sentance_transformer_tensorboard.py
import pandas as pd
import numpy as np
import plotly.express as px
from sklearn.decomposition import PCA
from sklearn.cluster import KMeans

# Load data
df = pd.read_pickle('all_user_embd-allmpnet.pickle')

# Feature engineering

## computeengine.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                kshitizregmi
                / computeengine.md
            
            
              Last active
              November 9, 2023 14:27
            
          
    How to Update disk size in compute engine

gcloud compute instances stop <your compute engine instance name>
gcloud compute disks resize <your compute engine name/disk name> --zone <your compute engine zone>  --size 30GB


## bq_read.py
import pandas as pd
from google.auth import default


# Get default credentials and project ID
credentials, project_id = default()


def read_bigquery_table(query, use_bq_storage_api=True):
    # Read a table from BigQuery using the BigQuery Storage API

## condaenv.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                kshitizregmi
                / condaenv.md
            
            
              Last active
              October 17, 2023 15:56
            
          
    Conda Environment
1. Create the Conda Environment:

Open a terminal or command prompt and run the following command to create a Conda environment with Python 3.11 (you can change the Python version to your preferred one):
conda create --name <your_env_name> python=3.11

  
## orphanbranch.md

      
              1 file
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                kshitizregmi
                / orphanbranch.md
            
            
              Created
              October 17, 2023 15:50
            
          
    Create a New Branch: Use the following command to create a new branch. Replace  with the name you want for your new branch.
git checkout --orphan <branch-name>
This will create a new branch with no commit history or files from the previous branch.
Remove Existing Files: Although the branch is empty, there might be untracked files left from the previous branch. To remove all these untracked files, use the following commands

  
## migrate.sh
gsutil ls gs://gen-app-src/ | head -n 2000 | xargs -I '{}' gsutil cp '{}' gs://gen-app-dst/

## rowise_duplicate_compare.py
import numpy as np
a = df.iloc[:, 0].values
b = df.iloc[:, 1].values

# Find the indices of matching elements
matches = np.where(a == b)

# Compare the indices across the two arrays
for index in matches[0]:
    if a[index] == b[index]:
	# for grayscale pdf
	gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -sColorConversionStrategy=Gray -dProcessColorModel=/DeviceGray -dPDFSETTINGS=/ebook -dDownsampleColorImages=true -dColorImageResolution=144 -dDownsampleGrayImages=true -dGrayImageResolution=144 -dNOPAUSE -dQUIET -dBATCH -sOutputFile=cvttranscript.pdf transcripts.pdf


	# for colored one
	gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -dPDFSETTINGS=/ebook -dDownsampleColorImages=true -dColorImageResolution=144 -dDownsampleGrayImages=true -dGrayImageResolution=144 -dNOPAUSE -dQUIET -dBATCH -sOutputFile=converted.pdf original.pdf
	Install and Convert Jupyter Notebooks to PDF

	This Gist provides a step-by-step guide to installing the necessary tools for converting Jupyter Notebooks (.ipynb) to PDF format using Python. It covers installing the required packages and tools (pip, nbconvert, notebook-as-pdf, pyppeteer, and playwright), and includes commands to convert notebooks either in bulk or individually to PDF format using the webpdf exporter.

	Steps included:
	1. Upgrade pip.
	2. Install notebook-as-pdf and nbconvert.
	3. Install pyppeteer and playwright for PDF rendering.
	4. Convert all .ipynb files in a directory or a single notebook file to PDF.
	from google.cloud import bigquery
	import google.auth


	class BigQuerySchema:
	def __init__(self):
	self.schema = []

	def add_fields(self, fields):
	for name, field_type in fields.items():
	import pandas as pd
	import numpy as np
	import plotly.express as px
	from sklearn.decomposition import PCA
	from sklearn.cluster import KMeans

	# Load data
	df = pd.read_pickle('all_user_embd-allmpnet.pickle')

	# Feature engineering
	import pandas as pd
	from google.auth import default


	# Get default credentials and project ID
	credentials, project_id = default()


	def read_bigquery_table(query, use_bq_storage_api=True):
	# Read a table from BigQuery using the BigQuery Storage API
	import numpy as np
	a = df.iloc[:, 0].values
	b = df.iloc[:, 1].values

	# Find the indices of matching elements
	matches = np.where(a == b)

	# Compare the indices across the two arrays
	for index in matches[0]:
	if a[index] == b[index]: