Skip to content

Instantly share code, notes, and snippets.

@mikekaminsky
Last active August 29, 2015 14:22
Show Gist options
  • Save mikekaminsky/63d1c782a797ca78a19f to your computer and use it in GitHub Desktop.
Save mikekaminsky/63d1c782a797ca78a19f to your computer and use it in GitHub Desktop.
Testing for Analytics and Datascience

Testing for Analytics and Datascience

Introduction

Historically software development for research and for the sciences has not been well tested. This has been the case for a variety of reasons, including:

  • Scientists and researchers are not developers, and simply don't know about best practices
  • Non-deterministic functions are difficult to test
  • Set-returning functions (most of data munging) are very time-consuming to test because you must generate a lot of test data
  • It's really difficult to think through all of the possible edge cases like you might try in TDD

Using and Testing Functions

Using and Testing Views

Other thoughts

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment