Skip to content

Instantly share code, notes, and snippets.

View minesh1291's full-sized avatar
On The Journey to Neverland

Minesh A. Jethva minesh1291

On The Journey to Neverland
View GitHub Profile
minesh1291 /
Created May 4, 2024 11:55 — forked from ruvnet/
A PyTorch implementation of a Mixture of Experts (MoE) model resembling the Mixtral 8x7B architecture, with detailed inline comments. This model combines transformer layers with an MoE layer consisting of 8 experts, aiming for high efficiency by activating only 2 experts per token. It's configured with dimensions reflecting the operational effic…
This model integrates the MoE concept within a Transformer architecture. Each token's
representation is processed by a subset of experts, determined by the gating mechanism.
This architecture allows for efficient and specialized handling of different aspects of the
data, aiming for the adaptability and efficiency noted in the Mixtral 8x7B model's design
philosophy. The model activates only a fraction of the available experts for each token,
significantly reducing the computational resources needed compared to activating all experts
for all tokens.
minesh1291 /
Created April 24, 2024 05:47 — forked from lostella/
Loading M5 competition data into a gluonts PandasDataset
# Works on gluonts dev branch as of May 30th, 2023
# Assumes "m5-forecasting-accuracy" folder with data next to the script
# Data is obtained from
import pandas as pd
from pathlib import Path
from gluonts.dataset.pandas import PandasDataset
# Load data from csv files
minesh1291 /
Created November 29, 2023 15:26 — forked from ruoyu0088/
a demo for zmq process with bokeh server
from os import path
from bokeh.models import Button, Div
from bokeh.layouts import column
from bokeh.document import without_document_lock
from import curdoc
from zmq_subprocess import ZmqSubProcessClient
ok_button = Button(label="ok")
div = Div()
minesh1291 /
Created October 29, 2023 02:46 — forked from denguir/
Installation procedure for CUDA & cuDNN

How to install CUDA & cuDNN on Ubuntu 22.04

Install NVIDIA drivers

Update & upgrade

sudo apt update && sudo apt upgrade

Remove previous NVIDIA installation

minesh1291 /
Created August 14, 2023 17:23 — forked from codeKgu/
Tutorial for multimodal_transformers
from transformers import Trainer, TrainingArguments
training_args = TrainingArguments(
minesh1291 /
Created August 14, 2023 17:22 — forked from codeKgu/
Tutorial for multimodal_transformers
from multimodal_transformers.model import AutoModelWithTabular, TabularConfig
from transformers import AutoConfig
num_labels = len(np.unique(torch_dataset, labels))
config = AutoConfig.from_pretrained('bert-base-uncased')
tabular_config = TabularConfig(
minesh1291 /
Created August 14, 2023 17:22 — forked from codeKgu/
Tutorial for multimodal_transformers
import pandas as pd
from import load_data
from transformers import AutoTokenizer
data_df = pd.read_csv('Womens Clothing E-Commerce Reviews.csv')
text_cols = ['Title', 'Review Text']
# The label col is expected to contain integers from 0 to N_classes - 1
label_col = 'Recommended IND'
categorical_cols = ['Clothing ID', 'Division Name', 'Department Name', 'Class Name']
numerical_cols = ['Rating', 'Age', 'Positive Feedback Count']
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
minesh1291 /
Created March 24, 2023 15:30 — forked from epignatelli/
pytorch Subset to return an instance of the parent Dataset, to be able to access the same attribute
An implementation of the pytorch Subset that returns an instance of the original dataset with a reduced number of items.
This has two benefits:
- It allows to stil access the attributes of the Dataset class, such as methods, or properties.
- You can use the usual python index notation with slices to chunk the dataset, rather than creating a list of indices
class Dataset(object):
def __init__(self, iterable):
self.items = iterable
minesh1291 /
Created September 26, 2022 08:03 — forked from tangh/
Show annotations in COCO dataset (multi-polygon and RLE format annos).
from pycocotools.coco import COCO
import numpy as np
import cv2
import os
coco_dataset_path = "/export/public/MS-COCO-2017/"
coco = COCO(coco_dataset_path + "annotations/instances_val2017.json")