Skip to content

Instantly share code, notes, and snippets.

View hohoCode's full-sized avatar

hohoCode

  • University of Maryland College Park
View GitHub Profile
# coding=utf-8
# Copyright 2023 The HuggingFace Inc. team. All rights reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
@hohoCode
hohoCode / list.md
Created November 10, 2020 17:38 — forked from ih2502mk/list.md
Quantopian Lectures Saved
''' Script for downloading all GLUE data.
Note: for legal reasons, we are unable to host MRPC.
You can either use the version hosted by the SentEval team, which is already tokenized,
or you can download the original data from (https://download.microsoft.com/download/D/4/6/D46FF87A-F6B9-4252-AA8B-3604ED519838/MSRParaphraseCorpus.msi) and extract the data from it manually.
For Windows users, you can run the .msi file. For Mac and Linux users, consider an external library such as 'cabextract' (see below for an example).
You should then rename and place specific files in a folder (see below for an example).
mkdir MRPC
cabextract MSRParaphraseCorpus.msi -d MRPC
@hohoCode
hohoCode / wavelet.c
Created October 14, 2012 06:31 — forked from Wollw/wavelet.c
Wavelet Tree
/*
* An implementation of the Wavelet Tree data structure.
* It is similar to a binary search tree using bits.
*
* More info here:
* http://siganakis.com/challenge-design-a-data-structure-thats-small
* http://www.alexbowe.com/wavelet-trees
*/
#include <stdio.h>
#include <stdlib.h>