Zheng Yan zyan0

## System Design.md

      
              1 file
            
          
              2612 forks
            
          
              58 comments
            
          
              9404 stars
            
          
                vasanthk
                / System Design.md
            
            
              Last active
              May 6, 2024 20:21
            
              
                System Design Cheatsheet
              
          
    System Design Cheatsheet


Picking the right architecture = Picking the right battles + Managing trade-offs

Basic Steps


Clarify and agree on the scope of the system


User cases (description of sequences of events that, taken together, lead to a system doing something useful)

Who is going to use it?
How are they going to use it?


## lstm-lm.py
#!/usr/bin/env python
# -*- coding: utf-8 -*-

# This is a simplified implementation of the LSTM language model (by Graham Neubig)
#
#  LSTM Neural Networks for Language Modeling
#  Martin Sundermeyer, Ralf Schlüter, Hermann Ney
#  InterSpeech 2012
#
# The structure of the model is extremely simple. At every time step we

## dword2vec.c
//  Copyright 2013 Google Inc. All Rights Reserved.
//
//  Licensed under the Apache License, Version 2.0 (the "License");
//  you may not use this file except in compliance with the License.
//  You may obtain a copy of the License at
//
//      http://www.apache.org/licenses/LICENSE-2.0
//
//  Unless required by applicable law or agreed to in writing, software
//  distributed under the License is distributed on an "AS IS" BASIS,

## service-checklist.md

      
              1 file
            
          
              185 forks
            
          
              25 comments
            
          
              715 stars
            
          
                acolyer
                / service-checklist.md
            
            
              Last active
              January 30, 2024 17:39
            
              
                Internet Scale Services Checklist
              
          
    Internet Scale Services Checklist

A checklist for designing and developing internet scale services, inspired by James Hamilton's 2007 paper "On Desgining and Deploying Internet-Scale Services."

http://mvdirona.com/jrh/talksandpapers/jamesrh_lisa.pdf

Basic tenets


 Does the design expect failures to happen regularly and handle them gracefully?
 Have we kept things as simple as possible?


## k-mean on histogram.c
gsl_rng_env_setup();
gsl_rng* rng = gsl_rng_alloc(gsl_rng_default);
sqlite3* db = 0;
int h[0x10000];
int kc[0x100];
float kmean[0x100];
uint16_t tbl[0x10000];
int i;
for (i = 0; i < 0x10000; i++)
	tbl[i] = i;

## Install.md

      
              1 file
            
          
              10 forks
            
          
              2 comments
            
          
              14 stars
            
          
                ravidsrk
                / Install.md
            
            
              Last active
              September 30, 2021 15:46
            
              
                Deploying django application with gunicorn nginx mysql
              
          
    Step One: Update Packages

sudo apt-get update
sudo apt-get upgrade
Step Two: Install and Create Virtualenv

sudo apt-get install python-virtualenv
sudo virtualenv /opt/myenv

  
## naivebayes.rb
spam_train, ham_train, spam_test, ham_test = ['train/spam', 'train/ham', 'test/spam', 'test/ham'].map{|t| Dir["#{ARGV[0]}/#{t}/*"].map {|fn| File.open(fn, 'r:iso8859-1').read.gsub(/[^a-zA-Z]/, ' ').split}}
spam_log, ham_log = [spam_train, ham_train].map{|t| t.flatten.instance_eval {reduce(Hash.new(0)) { |h,v| h[v] += 1.0/size; h }.select{|w, v| w.size > 2 && v > 8e-6}}.instance_eval{each {|k,v| self[k] = Math.log(v)}}}
spam_predict, ham_predict = [spam_test, ham_test].map {|t| t.map{|d| [spam_log, ham_log].map {|log| d.reduce(0){|s, w| log[w] ? s + log[w] : s}}}}
p spam_predict.size, spam_predict.select{|e| e.first < e.last}.size
p ham_predict.size, ham_predict.select{|e| e.first > e.last}.size

## nginx.conf
# 1. Make sure you have nginx sub module compiled in
# nginx -V  2>&1 | grep --color=always '\-\-with\-http_sub_module'

# 2. add two directives below at HTTP level

# nginx.conf
http {
        # ......

        sub_filter  '</head>' '<style type="text/css">html{ filter: progid:DXImageTransform.Microsoft.BasicImage(grayscale=1); -webkit-filter: grayscale(100%); filter: url("data:image/svg+xml;utf8,<svg xmlns=\'http://www.w3.org/2000/svg\'><filter id=\'grayscale\'><feColorMatrix type=\'matrix\' values=\'0.3333 0.3333 0.3333 0 0 0.3333 0.3333 0.3333 0 0 0.3333 0.3333 0.3333 0 0 0 0 0 1 0\'/></filter></svg>#grayscale"); /* Firefox 10+, Firefox on Android */

## BeyesianAvg.py
# -*- coding=utf-8 -*-
import collections

# Usage:
#   我的做法是把WordsDetector.py里的结果输出到文件，
#   然后把文件名放到下面的names列表中，运行本程序。

names = ['name0',
         'name1',
         'name2',

## latency.markdown

      
              2 files
            
          
              742 forks
            
          
              50 comments
            
          
              4387 stars
            
          
                hellerbarde
                / latency.markdown
            
            
              Created
              May 31, 2012 13:16
                — forked from jboner/latency.txt
            
              
                Latency numbers every programmer should know
              
          
    Latency numbers every programmer should know

L1 cache reference ......................... 0.5 ns
Branch mispredict ............................ 5 ns
L2 cache reference ........................... 7 ns
Mutex lock/unlock ........................... 25 ns
Main memory reference ...................... 100 ns             
Compress 1K bytes with Zippy ............. 3,000 ns  =   3 µs
Send 2K bytes over 1 Gbps network ....... 20,000 ns  =  20 µs
SSD random read ........................ 150,000 ns  = 150 µs

Read 1 MB sequentially from memory ..... 250,000 ns = 250 µs
	#!/usr/bin/env python
	# -- coding: utf-8 --

	# This is a simplified implementation of the LSTM language model (by Graham Neubig)
	#
	# LSTM Neural Networks for Language Modeling
	# Martin Sundermeyer, Ralf Schlüter, Hermann Ney
	# InterSpeech 2012
	#
	# The structure of the model is extremely simple. At every time step we
	// Copyright 2013 Google Inc. All Rights Reserved.
	//
	// Licensed under the Apache License, Version 2.0 (the "License");
	// you may not use this file except in compliance with the License.
	// You may obtain a copy of the License at
	//
	// http://www.apache.org/licenses/LICENSE-2.0
	//
	// Unless required by applicable law or agreed to in writing, software
	// distributed under the License is distributed on an "AS IS" BASIS,
	gsl_rng_env_setup();
	gsl_rng* rng = gsl_rng_alloc(gsl_rng_default);
	sqlite3* db = 0;
	int h[0x10000];
	int kc[0x100];
	float kmean[0x100];
	uint16_t tbl[0x10000];
	int i;
	for (i = 0; i < 0x10000; i++)
	tbl[i] = i;
	spam_train, ham_train, spam_test, ham_test = ['train/spam', 'train/ham', 'test/spam', 'test/ham'].map{\|t\| Dir["#{ARGV[0]}/#{t}/*"].map {\|fn\| File.open(fn, 'r:iso8859-1').read.gsub(/[^a-zA-Z]/, ' ').split}}
	spam_log, ham_log = [spam_train, ham_train].map{\|t\| t.flatten.instance_eval {reduce(Hash.new(0)) { \|h,v\| h[v] += 1.0/size; h }.select{\|w, v\| w.size > 2 && v > 8e-6}}.instance_eval{each {\|k,v\| self[k] = Math.log(v)}}}
	spam_predict, ham_predict = [spam_test, ham_test].map {\|t\| t.map{\|d\| [spam_log, ham_log].map {\|log\| d.reduce(0){\|s, w\| log[w] ? s + log[w] : s}}}}
	p spam_predict.size, spam_predict.select{\|e\| e.first < e.last}.size
	p ham_predict.size, ham_predict.select{\|e\| e.first > e.last}.size
	# 1. Make sure you have nginx sub module compiled in
	# nginx -V 2>&1 \| grep --color=always '\-\-with\-http_sub_module'

	# 2. add two directives below at HTTP level

	# nginx.conf
	http {
	# ......

	sub_filter '</head>' '<style type="text/css">html{ filter: progid:DXImageTransform.Microsoft.BasicImage(grayscale=1); -webkit-filter: grayscale(100%); filter: url("data:image/svg+xml;utf8,<svg xmlns=\'http://www.w3.org/2000/svg\'><filter id=\'grayscale\'><feColorMatrix type=\'matrix\' values=\'0.3333 0.3333 0.3333 0 0 0.3333 0.3333 0.3333 0 0 0.3333 0.3333 0.3333 0 0 0 0 0 1 0\'/></filter></svg>#grayscale"); /* Firefox 10+, Firefox on Android */
	# -- coding=utf-8 --
	import collections

	# Usage:
	# 我的做法是把WordsDetector.py里的结果输出到文件，
	# 然后把文件名放到下面的names列表中，运行本程序。

	names = ['name0',
	'name1',
	'name2',