lrvick/security.md

## security.md

      
    Raw
  

              security.md
            
          
    Security Policy

At #! we have some clear preferences when it comes to managing sevices to
maximize flexibility, reliability, and security.
This document aims to be an explicit policy of things to do and not do as well
as the rationale for these rules so as to inform decisions not yet explicitly
covered by this document.
Principles

Principle of Least Privilege

Do not give any service or system more rights and resources required to carry
out its well defined legitimate purpose. This is of course for security but
also for reliability.
Examples

A credential manager


Should encrypt all assets at rest
Should only decrypt the specific assets needed when they are needed
Should give out limited credentials to requesting services

Vault for instance can issue one-time-use mongodb credentials to services
when they are starting up.


A recurring job that compiles static assets


Should only need write access to one directory

Use filesystem namespacing, and only mount the specific path needed


Should only need very minimal resources

Use cgroups to set worst-case upper bounds for cpu/threads/memory, etc.


Should -not- need to see other processes on the system

Use process namespacing


Should only have access to minimal filesystem write system calls

An ssh bastion host


Does not need a package manager or many packages

Use packer or similar to build a bare minimum image.


Does not need write access to the disk

Boot with fsprotect or similar


Does not need to offer shell accees

/bin/true is all that should be required for ssh passthrough


Resources


Principle Of Least Privilege - Wikipedia
Containers vs Zones vs Jails vs VMs - Jessie Frazelle
How to use Docker seccomp profiles
Bane: AppArmor profile generator for Docker
Isolate containers with a user namespace - Docker Docs
Role Based Access Control in K8s
Security: OpenBSD vs FreeBSD
Managing compute resources for Containers
Package: fsprotect

Fault Tolerance

It should be possible to destroy any single server at any time with minimal
user facing disruption. The Netflix idea of "monkey testing" should be
something we strive to replicate.
Examples

Amazon terminates our bastion host without warning


Should be contained in an autoscale group on a fixed template.
Should fail TCP:22 health checks so an identical server will be started
Will automatically update DNS, load balancer, or recycle an IP.

A bug causes widespread MongoDB database corruption


We should have a continued streaming backup of our oplog
We should be able to detach all mongo instances to trigger replacements
We should be able to oplogReplay to restore to last known-good timestamp

Resources


The Netflix Simian Army
Chaos Monkey
Pumba: Chaos testing and network latency generator for Docker
Incremental backup/restore via Oplog Replay in MongoDB

Kerckhoffs's Pinciple

"A cryptosystem should be secure even if everything about the system, except
the key, is public knowledge."
Assume our adversaries are patient and have all of our documentation,
including this document.
Design everything so that if sales chooses to use it as a point of pride to
when talking to clients, this can't hurt us.
Also when designing things, do it with the same level of care and abstraction
you would if we were making it open source, even if we are not.
Examples

Our adversary has all of our documentation


They should learn they can't modify our systems without 2 yubikeys with pins.
They should learn our barrier is very high, and move on to easier targets.

Our adversary gains read/write access to our database


All private keys are encrypted with key only held by HSM
HSM should only decrypt private keys inside itself, without revealing them.
Tampering with a transaction will be rejected by HSM due to a bad signature

Resources


Kerkhoffs's Pinciple - Wikipedia
A note about Kerkhoffs's Principle

Distrust, then verify.

Assume any system or software we use can be compromised at any time. Assume our
attackers have 0-days they are willing to burn on us.
Security is hard enough with open source code where anyone can audit it. A
closed solution should always be highly distrusted, and given the minimal
amount of access to allow the business to function.
If there is an open source solution with wide community auditing, we should
always favor that as time allows.
We must always maintain ways to verify the integrity of systems at any time,
especially those that grant untrusted third parties access to our systems, such
as a janitor at a datacenter.
Examples

We don't have time to run our own VCS in the short term


Third party systems that allow us to -prove- their integrity are preferred
Use Github, but sign all commits and verify externally before builds.

We recently patched a remote code execution vulnerability


Rebuild every system impacted. This easily proves there are no backdoors

Resources


Open Source Security - RedHat
Rootkit / Evil Maid Attack
Keystone Metrics in DevOps: The 30 Day project

Avoid Lock-In

As a organization with limited resources, we must sometimes employ the help of
third party services in the short term.
This should always be treated as technical debt, and we should do so with a
clear plan of escape when said vendor has an unacceptable change in their
availability, rules, or pricing.
Examples

We need Redis nodes but don't have the resources to self-host today


AWS Provides Elasticache which is turn-key vanilla redis
We can easily port the data to our own servers at any time if forced

We need a system to manage credentials/rights for servers


IAM and KMS are powerful, but not portable.
Favor rights management via pub/sub and open tools like Vault where possible

We need to reliably scale custom applications


Favor cloud-agnostic deployment tools like Helm, Docker, and Kubernetes.
Ensure applications have no awareness of what cloud provider they run in

Resources


Vault - Infrastructure secret/key management
Kubernetes - Container orchestration
Helm - Kubernetes package manager
Docker/Kubernetes Hello world

Recommendations

The following will detail specific implementations of the above principles as
recommended by the #! team.
Vendors

The following vendors have security practices we consider to be equal to or
greater than ours and are generally trusted.
Acceptance


We must be able to cryptographically prove all changes and reviews made by
All code must be public and open source
Must be active in OSS community and transparent about security practices.
Authors individually sign code and binaries
All code is reviewed by at least one engineer other than author.
Package management system automatically verifies signatures/hashes
Must do reproducible builds and avoid SPOF where possible

Trusted


The Debian Project
Fedora Foundation
Gentoo Foundation
OpenBSD Foundation
FreeBSD Foundation
Arch Linux
Canonical
Google
Apple
Guardian Project
Tor Project
Bitcoin Core
ZCash Foundation
Cloud Native Computing Foundation
Inverse Path
Linux Foundation
Hashicorp
CoreOS

Untrusted

Obviously anything not on the Trusted list is untrusted but we will spell out
some commonly trusted entities we explicitly -never- trust, due to poor
security practices.

nixpkg
pypi
npm
maven
luarocks
snap
chocolatey
brew
macports
cargo
cabal
Google play store
Chrome Store
Apple App Store

Exceptions


You can run/evaluate untrusted code but only in dedicated VMs/hardware.
Untrusted environments must never be able to interact with production systems

Operating Systems

Criteria


Package manager must verify all packages for integrity/signatures by default
Signing must be distributed and be totally transparent.
Public reproducible build systems
Historically fast response to critical security issues
Mostly or totally open source and auditable

Trusted


Arch
Ubuntu
Gentoo
Debian
OpenBSD
FreeBSD
ChromiumOS
ChromeOS
HashbangOS
F-Droid

Untrusted


Blackberry
Windows
Windows Mobile
MacOS *
Google Android *
iOS *

Secrets

In this section "Secrets" are defined as private keys or that would allow
manipulation of customer wallets/policies or that would allow viewing must
never be exposed to system memory of any device except an approved HSM.
Required


FIDO hardware authentication must be exclusively used for supporting services

DockerHub, Github, Amazon, etc


All keys with the ability to -write- to a VCS -must- be on HSM devices

Exceptions


Services that lack hardware U2F support may be used with Yubico Authenticator
Passwords which -must- be directly visible to an application in order for it
to function, such as websites. These secrets may be maintained by a password
manager that only decrypts the exact secret being requested with explicit
approval by user with an HSM.
Secrets can be made available to automatically managed applications on
security hardened systems that humans lack direct access to, and only when use
of an HSM is not practical.

Coding Practices


Third party libraries must be reviewed and hash-lcoked in #! repos before use
A new version of a third party library requires explicit review/approval.
Automated CVE assesments on third party libraries must be present
Favor including functions instead of libraries where possible
Avoid packages that depend on low-quality dependencies like "is-even"
All project dependencies must be hash-locked with a package manager manifest
All code must be reviewed by an approved vendor.