Skip to content

Instantly share code, notes, and snippets.

// Configuration for a basic LSTM sentiment analysis classifier, using the binary Stanford Sentiment
// Treebank (Socher at al. 2013).
{
"dataset_reader": {
"type": "sst_tokens",
"use_subtrees": true,
"granularity": "2-class",
"tokenizer": {
"type": "spacy"
},
{
"dataset_reader": {
"type": "sst_tokens",
"token_indexers": {
"tokens": {
"type": "pretrained_transformer",
"model_name": "bert-base-uncased",
"namespace": "tokens"
}
},
@xinzhel
xinzhel / linux _environment_setup_on_windows.md
Last active May 30, 2022 08:04
Linux environment setup on Windows (WSL 2)
  1. Install WSL 2 (with separate linux kernel) and Linux distribution
# run the following command in PowerShell
$ wsl --install -d Ubuntu-18.04

# Ubuntu-18.04 is my preferred Linux distribution. You can find all the supported distributions by running
# wsl --list --online
  • Caveat: you may not launch wsl or Ubuntu without restarting my PC to launch the Ubuntu
@xinzhel
xinzhel / windows setup.md
Last active July 10, 2023 08:35
Windows Setup
wget https://repo.anaconda.com/miniconda/Miniconda3-py38_23.3.1-0-Linux-x86_64.sh
bash Miniconda3-py38_23.3.1-0-Linux-x86_64.sh

This will download Python version 3.8. Check links here for more versions.

@xinzhel
xinzhel / learning-resource-for-ml-and-nlp-practitioner.md
Last active October 31, 2021 08:55
learning-resource-for-machine learning and natural language processing
@xinzhel
xinzhel / reuters-json.py
Last active November 14, 2021 23:13
convert Reuters dataset on kaggle to json file
import logging
import os
import sys
import json
from typing import Dict, Optional
from tqdm.auto import tqdm
import timeit
import numpy as np
test = []
train = []
sudo apt update && upgrade
# install pip, ipython3
sudo apt install python3-pip
# 1. load pyenv
curl -L https://github.com/pyenv/pyenv-installer/raw/master/bin/pyenv-installer | bash
#!/bin/bash
# Script to create Azure DSVM Spot instance with NVidia P100 GPU
read -p "Azure VM Name (default: dsvm): " vminput
vmname=${vminput:=dsvm}
while [ $password != $password2 ] ; do
read -s -p "Choose your Password: " password
echo
read -s -p "Re-enter Password: " password2
echo

Arxiv and PubMed

https://github.com/armancohan/long-summarization

CNN Daily Mail

$wget https://storage.googleapis.com/allennlp-public-data/cnndm-combined-data-2020.07.13.tar.gz 
$tar -xzf cnndm-combined-data-2020.07.13.tar.gz
$mv cnndm-combined-data-2020.07.13 cnn_dm
@xinzhel
xinzhel / nlp_publications.md
Last active November 18, 2022 06:06
All you need for NLP Publications (Focusing on Conferences; Keep Updating...)