Skip to content

Instantly share code, notes, and snippets.

View jessvb's full-sized avatar

Jessica Van Brummelen jessvb

View GitHub Profile
@jessvb
jessvb / .stats_in_python.md
Last active January 27, 2024 15:21
Notes for how to approach and do certain statistical tests. The particular details are in Python, but a lot of the notes are generalizable.

How to approach statistics

Before you code, do three things:

  1. Think about what you want to show in your data. For example, how would you like to ultimately visualize it or what would you ultimately like to say about it? Here are some helpful info graphics from a data science cheatsheet article on Medium:

image

image

  1. Do quick initial visualizations of your data to get a sense of their distributions, whether it's actually worth doing a statistical test (i.e., does it look like there might be a significant difference?), and whether there are any outliers (which could be interesting in themselves! E.g., why did that particular participant choose xyz vs. how the
@peci1
peci1 / Abbreviate Journal Names in Bibtex Database.py
Last active January 28, 2021 07:22 — forked from FilipDominec/Abbreviate Journal Names in Bibtex Database.py
Using the translation table from the Jabref program, finds and replaces all scientific journal names to their standardized abbreviated form. First argument is the file to be processed; outputs safely to 'abbreviated.bib'
#!/usr/bin/env python
#-*- coding: utf-8 -*-
import sys, os
import re
try: bibtexdb = open(sys.argv[1]).read()
except: print "Error: specify the file to be processed!"
if not os.path.isfile('journalList.txt'):