-
Anscombe's quartet
-
A modern guide to getting started with Data Science and Python
- numpy -> pandas
- matplotlib -> seaborn
-
단어
- 확증 편향: 신념과 일치하는 정보만 받아들이는 경향
- 잘못된 인과관계의 오류: 단순 선후 관계 사건을 인과관계로 잘못 해석하는 것
-
ipython
- Reloading submodules in IPython
%load_ext autoreload %autoreload 2
-
python
- sort a list of dictionaries by value of the dictionary
from operator import itemgetter newlist = sorted(list_to_be_sorted, key=itemgetter('name'), reverse=True)
-
Data Mining
-
etc
- 기타
-
Mac OS에서 java경로 찾는 방법
/usr/libexec/java_home -v 1.7 # java 1.7 path /usr/libexec/java_home -v 1.6 # java 1.6 path
-
- 해시 값으로부터 원래의 입력값과의 관계를 찾기 어려운 성질을 가지는 해쉬함수
-
AOV = total sales revenue / total number of sales
-
Information Gain vs. Gain Ratio
-
Data Analysis
-
기타 링크
-
KDE - Kernel Density Estimation
-
One-hot encoding
-
- 유저에게 광고가 노출된 시간으로
- 누구에게 유리할것인가?
- 광고가 유저에게 오래동안 노출되면 annoying이 될것 같은데..
-
- 공유경제를 안좋게 보자면, 개인의 모든것을 상품화 시켜서 삶자체를 상품으로 만든다는 이야기.
- 남는 시간, 남는 공간, 남는 친절을 모두 상품화해서 판매하는 개인.
- 하지만 당분간 이쪽으로 새롭게 나오는 서비스는 계속해서 늘어날것 같다.
-
- 라인페이 (결제, 예정)
- 라인와우 (배달서비스, 예정)
- 라인택시 (콜택시, 예정)
- 라인맵스 (지도서비스, 예정)
- 라인앳 (상업용 계정, 예정)
- 라인 유료 공식계정 (연예인용 광고계정, 예정)
- 라인게임 (게임, 예정)
- 라인뮤직 (음원, 예정)
- 라인블로그 (블로그 플랫폼, 예정)
- 라인비즈니스커넥트 (비즈니스 메신저)
- 라인몰 (전자상거래)
- 라인스토어 (라인 관련 쇼핑몰)
- 라인데코 (스마트폰 꾸미기)
- 기타
-
시각화
-
CrossDomain
-
The Data Analytics - Free Book
- Communication skills are underrated If you can't present your analysis into digestible concepts for your CEO to understand, your analysis is only useful to yourself.
- The biggest challenge for a data analyst isn't modeling, it's cleaning and collecting Data analysts spend most of their time collecting and cleaning the data required for analysis. Answering questions like "where do you collect the data?", "how do you collect the data?", and "how should you clean the data?", require much more time than the actual analysis itself.
- A Data Scientist is better at statistics than a software engineer and better at software engineering than a statistician The greatest difference between a data scientist and a data analyst is the understanding of computer science and conducting analysis with data at scale. Data scientists only need a basic competency in statistics and computer science and not all are Ph.Ds. New tools are empowering more people to do data science.
- Do your own projects to break into the industry. The truth is, even in a quantitative major you are not taught what you need to know to work in data analytics. There is a learning gap between academia and industry that is best filled by doing projects. Find some sports statistics and do your own analysis. Learn R so that you can complete this analysis, not just to learn R itself. Also try Kaggle.
- Statistics > Programming. The development of tools and popularity of programmers has caused black box statistical analysis usage. Understanding selection bias vs. sampling bias and the underlying assumptions to which statistical functions are built on will make your opinions matter and your work invaluable.
- The most important skill is being able to ask the right questions. The power of data analytics is in taking open response questions and framing them to be multiple choice. Therefore if you have the ability to filter a million questions into options A through D, you are a data scientist for hire.
-
온라인 뉴스
-
Python
-
AngularJS
-
A/B Test, Multi-armed Bandit
-
Bayesian
-
Courses
-
Machine Learning 관련 Services
-
This is Digital Marketing: From Ad Men to Math Men | Mediative
-
커닝햄 법칙 (Cunningham's Law) The best way to get the right answer on the Internet is not to ask a question, it’s to post the wrong answer. 인터넷에서 정답을 알아내는 최고의 방법은, 질문을 하는 것이 아니라 잘못된 답을 올려놓는 것이다. http://meta.wikimedia.org/wiki/Cunningham's_Law 영원한 9월(Eternal September) “인터넷 개통 축하드립니다.”라는 의미로 쓰는 듯. http://en.wikipedia.org/wiki/Eternal_September 스트라이젠드 효과 (Streisand effect) 인터넷 확산을 막으려고 법적 대응을 하다가 그것 때문에 역으로 더 유명해지는 현상. http://en.wikipedia.org/wiki/Streisand_effect
- sed one line: http://sed.sourceforge.net/sed1line.txt
- awk one line: http://www.pement.org/awk/awk1line.txt