Skip to content

Instantly share code, notes, and snippets.

View huichen's full-sized avatar

Hui Chen huichen

  • DeepKnowledge AI
  • Hangzhou
View GitHub Profile
@huichen
huichen / classifier.go
Last active December 31, 2015 00:19
一百行实现一个分类器
package main
import (
"flag"
"github.com/huichen/mlf/contrib"
"github.com/huichen/mlf/eval"
"github.com/huichen/mlf/model"
"github.com/huichen/mlf/optimizer"
"log"
"os"
6KcRUyqipqP9bKUidj7rrB_CMhojopf88D7vveDvW3-OiVa6kvqyfQWUX6sqsFwJqii6YvVHbmHq3eOiVA2cPJLo0kLbEkJ76Tfwb7BU264F43iiMBxq_S2YBoBnGaxuo1yDnm42t9eYO_89JiajmKlo_Srkrykm1aB95-i8K55yOcoH2sAGC7T-nNyjjWuEBjq4XyDHGUL_v6Dd0eZuDK4FxTxonNWot-k6Ge1bX2fqL6Z4GftitSXd3KTo-TkRZrSja4D6FREwjZqqZZ3PVCNxYOiz2-1KK1qSkg4U4sAP8Opgfagj-kUSozxa3aSU3vZ923UxzNOVRTnTYdiVbuOAbYlQWBzy3xDMLpUCTFj1imofFKjFUQoLferxRWSF57LwbFyfHagNkBi0CKhr5123wec-LdlyeIEwQDqI3DVZL5Am3cmKXnflAKI4WSWiQAi5Ixy8TJwnjCgHjdBAFPX4ASM1yFMpY1JaxBHfe7lbsq8d8Lnp2GFn-Rv3G_fczY9yDjw6CaQmG9LH7EKDol-_qG5YyDaL82mJxX0yEixVSLcOG2QUbKCa6gZzBqUw-xY_Lld9K7_jrgCMrbKx594GJe2-u36OTSeQNoHvZIFYOlA11IAsepm8R9_dobjya2_tXJE9tXYJxBle8rz-WAIXBCGmZISVvl-WQshKNCuHV8AOi31OlHpyI161tlHD5CkPBuCXuvDroE_Qy0ttxE9wJg15uNFh8FsxzvIcDFOnwT2TLlTUfwOxlkmGO1Qpnffcu9THfkG5-gUR5csz4E36kQPP3gtXT45lRF3499c0r8EEC1OM1KTZO5EYkLvyE95mZPp7zAQrB3czqCCWWfk4sQU_Z8Xzv_GtnA1cso8HrrFymL99oa_eKlofHeyNRGYO5uGdLz86vSJhGfKh-JYM-iY8Zf3Y_wZlZqKegtLQikK9vVSJZdQy7lw3756MUlSZVEaXhUuY2SgdtF1hIsVCCqa8IDjQUtWq2eRXjseZkhyzytpe-BeiSGFS4TvY
@huichen
huichen / count_tablestore.go
Created September 7, 2019 15:12
采用多线程和采样算法较快地统计阿里云 tablestore 中满足某个条件的记录数
package main
// 假设你的 tablestore 中有两个主键,其中一个是另一个的 128bit md5 hash string
// 可以使用这个程序较快地统计满足某个条件的记录数
// 采用了多线程和采样,可以将统计速度提升 100 倍
// 我的经验,如果 sampleRation = 0.01 的情况下,假设表格中有 100 万行,统计一遍大约需要 10 秒
import (
"fmt"
"log"