Luis Tung luistung

  • Hangzhou, China
luistung / fmt-quickscript.rkt
Last active September 21, 2024 17:46 — forked from Metaxal/fmt-quickscript.rkt
Quickscript for sorawee's `fmt`
#lang racket/base
(require quickscript
fmt ; needs to be installed first
;;; Author: Laurent Orseau
;;; License: [Apache License, Version 2.0]( or
;;; [MIT license]( at your option.
luistung /
Last active April 28, 2024 09:53
finetune llm example
from transformers import AutoTokenizer
from datasets import Dataset
import torch
import pandas as pd
import numpy as np
tokenizer = AutoTokenizer.from_pretrained("gpt2")
tokenizer.pad_token = tokenizer.eos_token # 设置填充符号
qa_pairs = [
luistung /
Created April 28, 2024 09:45
continue pretrain example using hugging face
import torch
from transformers import AutoModelForCausalLM, AutoTokenizer, Trainer, TrainingArguments
from datasets import Dataset
# 选择模型,这里可以替换为任何 transformers 支持的模型,如 "bert-base-uncased", "gpt2" 等
model_name = "gpt2"
device = torch.device("cpu")
# 加载模型和分词器
model = AutoModelForCausalLM.from_pretrained(model_name).to(device)

Keybase proof

I hereby claim:

  • I am luistung on github.
  • I am luistung ( on keybase.
  • I have a public key ASCcsP70NQH1pe_YHkb_VWNteyojKMnNa4gSEpNRxAF3_Qo

To claim this, I am signing this object:

luistung / CMakeLists.txt
Created June 11, 2021 02:58
pytorch to c++
cmake_minimum_required(VERSION 3.0 FATAL_ERROR)
find_package(Torch REQUIRED)
add_executable(example-app example-app.cpp)
target_link_libraries(example-app "${TORCH_LIBRARIES}")
set_property(TARGET example-app PROPERTY CXX_STANDARD 14)
luistung / tokenization.cpp
Last active May 30, 2024 03:15
c++ version of bert tokenize
/* c++ version of tokenization for bert
Copyright (C) 2019 luistung
This program is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.
This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
#include <iostream>
#include <fstream>
#include <string>
#include <vector>
#include <unordered_map>
#include <boost/algorithm/string.hpp>
#include <utf8proc.h>
luistung / .vimrc
Last active April 24, 2019 03:31
vim config
syntax on
colorscheme desert
set number
set cindent
set tabstop=4
set shiftwidth=4
set expandtab
set termencoding=utf-8
set fileencodings=utf-8
set encoding=utf-8
luistung /
Created July 29, 2014 08:54
from BaseHTTPServer import HTTPServer, BaseHTTPRequestHandler
class TestHTTPHandler(BaseHTTPRequestHandler):
def do_GET(self):
self.protocal_version = 'HTTP/1.1'
self.send_header("Welcome", "Contect")
self.wfile.write('hello world')
http_server = HTTPServer(('', 7778), TestHTTPHandler)
luistung / SConstruct
Last active August 29, 2015 14:03
env = Environment(CPPFLAGS=["-std=gnu++0x", "-Wall", "-O0"])
env.SharedLibrary('foo', ['f1.cpp'])
env.Program("test.cpp", LIBS=["boost_regex", "boost_locale", "foo"], LIBPATH='.')