chrisbloom7/meet_chef_notes.md

## meet_chef_notes.md

      
    Raw
  

              meet_chef_notes.md
            
          
    Meet Chef

My notes from the Meet Chef course at
http://pluralsight.com/training/Courses/TableOfContents/meet-chef
Introduction

Chef is a Ruby framework for automating, reusing and documenting server
configuration. It's like Unit tests for your servers.
One of the primary features of Chef is that its recipes are idempotent -
You can run a script several times, but it won't change anything after
the first run. If none of your inputs to Chef change, running it over
and over should not try to run all the commands over and over.
Limitations

Chef will not magically configure your server. It only runs recipes
that you specify with the inputs you give it.
It's important that you do not blindly use other Chef cookbooks or
recipes. No two companies have exactly the same server architecture.
Chef does not monitor the runtime behavior of an of the software it
configures. It can not tell you whether or not a service is running.
It is a short lived deployment service, not a continuously running one.
It was built to be run occasionally to keep a server's configuration in
a specific state.
Chef doesn't have any concept of undoing changes. It's up to us to
rollback any changes if we feel a mistake has been made. It does backup
some configuration files when it makes changes, but it doesn't have any
tools to restore these files in the event of a mistake.
We should always test our cookbooks and recipes in a virtual environment
before deploying.
Why Use Chef?

Chef makes server configuration readable, accessible, and repeatable.
In addition, automated deployments are faster than connecting via SSH
and running one command after another via the shell terminal.
Because Chef cookbooks are both reusable and idempotent, Chef can take
some of the fear and guesswork out of sysadmin work. While Chef doesn't
save you from the need to understand what you're doing as a sysadmin, it
gives you a much more friendly environment for dealing with it.
Chef recipes are a great way to learn about how the software works
rather than using reading through a lot of disparate manuals.
It's also useful for keeping development in sync with production. Doing
so minimizes deployment issues.
Part 1

Chapter 2 - Terminology

node

A node is a server, or a host. It could be for the web, database,
worker, etc.
chef client

The command line program that configures the server from the node, or
host
Chef server

stores information about the nodes. is rest-based. opscode is an
example, but could be self-hosted
chef-solo

a standalone version of the chef client that doesn't require a chef
server. point it at a recipe instead
chef recipe

A ruby file containing instructions for building a node, each executed
in order
resources

files, directories, users and services needed by a node
cookbook

a collection of recipes, and associated files, such as configuration
file templates.
role

reusable configuration across multiple nodes. I.e. a web role that
configures a server distributed over 5 nodes, or a database role that
configures slave/master database nodes.
run list

an array of recipes and roles defining what gets executed on a node
attributes

variables that are passed through chef and used in recipes and
templates, such as software version.
template

a file with placeholders for attributes
notification

when one resource is changed, it can trigger an update in another
resource. Ex: If an Nginx configuration file is updated, then
notify the nginx resource to restart
Chapter 3 - Install a Virtual Machine

We need

VirtualBox (https://www.virtualbox.org/)
Starter code (https://github.com/jsierles/peepcode-chef-recipes.git)
Vagrant (http://vagrantup.com)

The starter code includes a Vagrantfile that's setup to run a
preconfigured Debian Squeeze 64 setup. It also comes bundled with Ruby
1.9.3 and Ruby Gems 1.8.17.
vagrant up
vagrant ssh

Chapter 4 - Write an Nginx Recipe

In a new Terminal tab, from the starter code root:
cd nginx
rm -rf recipes/ templates/ attributes/ # because we want to start from scratch
mkdir recipes
mkdir templates
mkdir attributes
touch recipes/default.rb

Writing the Default Recipe

default.rb is the recipe that is run by default when no specific
recipe is specified.
The first step will be to install the nginx package. (A package is
a chef resource which relies on a package management system on the host
OS, i.e apt)
Specifying a version is only useful if you want to stop Chef from
upgrading to a later version in the future.
package "nginx" do
  version "1.0.3"
end

We can just stick with the default version, so:
package "nginx"

If we run the recipe now, chef will try to install nginx (using apt-get)
if it hasn't already been installed.
We also want to be able to start and stop nginx, so we'll install it as
a service:
package "nginx"

service "nginx"

During an update, other recipes can now determine if the state of the
nginx service.
Most unix systems run process daemons from the /etc/init.d directory.
Chef relies on the package installer (i.e. apt-get) to do this, then
sends the appropriate commands (start, stop, etc.) and assumes it will
do the right thing
We want to be able to get status, restart and reload for the nginx. And
we want it to be started when the OS boots and when chef runs. So:
package "nginx"

service "nginx" do
  supports :status => true, :restart => true, :reload => true
  action [:enable, :start]
end

enable ensures the service is started when the OS boots, start means
Chef should start the service if it isn't already.
Using Templates

The config file is where we can customize nginx, based on our config
variables (with web directory variables, etc)
We could start with nginx's default config file, since that has a lot of
settings we can use already. But... we don't have nginx installed yet.
We can install it temporarily to get the file:
sudo su
apt-get update
apt-get install nginx
cat /etc/nginx/nginx.conf


I had to run apt-get update first before I could install nginx

We can update any of these options dynamically using a chef template.
Chef templates use erb syntax. We tell chef where the template lives.
If we specify a block, we can do other things such as trigger
notifications.
Continuing in default.rb:
template "/etc/nginx/nginx.conf" do
  notifies :reload, "service[nginx]"
end

The :reload refers to reload on line 4. (This is an nginx shortcut
for reading config changes that don't require a full restart.)
service[nginx] refers to line 3 where we referred to nginx as a
service.
When chef is run, it will look for a template file named
templates/nginx.conf.erb. We could also explicitly set the source
template file within the block using source "nginx.conf.erb", but
this is unnecessary because chef uses convention-over-configuration to
determine the template file name.
We can create the template file by copying what we cated from nginx's
default config file and pasting it into nginx.conf.erb. This is good
practice to ensure we're as close to the standard setup as possible.
Before we do that, create a default folder in the templates folder.
This has nothing to do with the default.rb file we created earlier.
It refers instead to the default host server. This templates directory
can have templates customized for each host machine. For example, we
could have made a beta directory for a beta server. default will
be the directory that gets used for all hosts in this recipe.
Now we create the file there. It won't have any ERB tags in it just yet.
Chapter 5 - Run Chef

Going back to the VM we are SSH'd into, let's install chef-solo:
gem install chef --no-rdoc --no-ri

We now have the chef-solo command line utility available to us. If we
run it now, it will complain that it can't find any cookbooks or a
configuration file. We can tell chef-solo where to look for cookbooks
by setting up a config file like so:
mkdir /etc/chef
echo "cookbook_path \"/cookbooks\"" > /etc/chef/solo.rb

Now if we run chef-solo -l info it no longer complains, but we can
see that the run list (recipes or roles) is empty. Since we only want
nginx, we can pass it in directly using a json file. We can use vim /etc/chef/node.json to create this:
{
  "run_list": ["recipe[nginx]"]
}

This refers to the nginx cookbook, and the default recipe. If we had
another recipe, for example a "client" recipe, we could specify it
after a double colon:
  "run_list": ["recipe[nginx::client]"]

When writing this file, always use double-quotes, and be sure to close
all brackets and quotes. Now save the file and quit vim.
Now we tell chef that we're going to use our own run list:
chef-solo -l info -j /etc/chef/node.json


I had to add the -l info option to the command to get the same
level of output as was shown in the video.

The output looks like:
[2014-03-18T15:34:05+01:00] INFO: Forking chef instance to converge...
Starting Chef Client, version 11.10.4
[2014-03-18T15:34:05+01:00] INFO: *** Chef 11.10.4 ***
[2014-03-18T15:34:05+01:00] INFO: Chef-client pid: 1905
[2014-03-18T15:34:07+01:00] INFO: Setting the run_list to ["recipe[nginx]"] from JSON
[2014-03-18T15:34:07+01:00] INFO: Run List is [recipe[nginx]]
[2014-03-18T15:34:07+01:00] INFO: Run List expands to [nginx]
[2014-03-18T15:34:07+01:00] INFO: Starting Chef Run for vagrant-debian-squeeze-64.vagrantup.com
[2014-03-18T15:34:07+01:00] INFO: Running start handlers
[2014-03-18T15:34:07+01:00] INFO: Start handlers complete.
Compiling Cookbooks...
Converging 3 resources
Recipe: nginx::default
  * package[nginx] action install[2014-03-18T15:34:07+01:00] INFO: Processing package[nginx] action install (nginx::default line 1)
 (up to date)
  * service[nginx] action enable[2014-03-18T15:34:07+01:00] INFO: Processing service[nginx] action enable (nginx::default line 3)
 (up to date)
  * service[nginx] action start[2014-03-18T15:34:07+01:00] INFO: Processing service[nginx] action start (nginx::default line 3)
[2014-03-18T15:34:08+01:00] INFO: service[nginx] started

    - start service service[nginx]

  * template[/etc/nginx/nginx.conf] action create[2014-03-18T15:34:08+01:00] INFO: Processing template[/etc/nginx/nginx.conf] action create (nginx::default line 8)
 (up to date)
[2014-03-18T15:34:08+01:00] INFO: Chef Run complete in 0.953383215 seconds

Running handlers:
[2014-03-18T15:34:08+01:00] INFO: Running report handlers
Running handlers complete

[2014-03-18T15:34:08+01:00] INFO: Report handlers complete
Chef Client finished, 1/4 resources updated in 2.514633292 seconds

Using Attributes

As we can see in the output above, chef doesn't actually perform any
actions other than restarting the nginx service. That's because we
already have nginx installed, and our template file is exactly the same
as the nginx.conf file already in use. Let's go back to our template
and change the keepalive_timeout setting to 5. Then return to the
VM and re-run the chef-solo command. Now the output includes info
about backing up the old config file and updating the original. It even
includes a diff of the changes. If we go back and look at the live
config file we'll see our changes are now present.
This is great, but not terribly useful to pass in static values. A
better use is using attributes.
Cookbooks can define attributes for use anywhere in one of their
templates. It's always good to provide a default value for any
attribute used in your cookbooks. Note that attribute values have
different levels of precedence depending on where you set them. The
order of precedence (highest -> lowest) looks like this:
Node <- Role <- Environment <- Cookbook
Environment means the same thing here as it would in Rails:
development, test, staging, production, etc. Here's an example:
Say we want to define the number of worker processes for our servers.
We might define that number to be 4 in our cookbook since that's a safe
default. Our cookbook will also be used to setup our development
environment, so we could let that environment specify that it only
needs a single worker process. We then have a Role for setting up web
servers, and we decide that our hardware specs mean they can
comfortably handle 6 worker processes instead. Then for one of those
nodes, perhaps an older legacy server that we've yet to retire, we
don't want to risk overtaxing the CPU so we can set it to use a value
of "2" worker processes instead.
In addition, Chef has different kinds of attributes, each with their own
precedence:
Automatic <- Override <- Normal/Set <- Default
The lowest level point where we can create attributes is inside the
attributes directory of the cookbook itself.
touch attributes/nginx.rb

Some people prefer to use the name default.rb, but that can be
confusing regarding the name default. It's also easier to search the
cookbook for the name of the package (i.e. nginx).
Default Attributes

Inside this file we'll use what are called "cookbook default
attributes". It's almost always best to use "default attributes" in
practice. (If default attributes aren't powerful enough, we can explore
chef's other attributes, but it's likely we'll never need to.) Default
attributes are attributes that can be overridden at the environment or
role level.
In this file, we'll specify:
default[:nginx][:dir] = "/etc/nginx"
default[:nginx][:worker_processes] = 4

Attributes can be set using symbols (as above), or string keys, or
using dot notation. The line above could also be written as:
default["nginx"][:worker_processes] = 4

or
default.nginx[:worker_processes] = 4

It's best to choose one style and stick with it.
Attributes are embedded in templates using the standard ERB output
notation via the @node instance variable. In our nginx.conf.erb
file, we can use ours like so:
worker_processes  <%= @node[:nginx][:worker_processes] %>;

The @node object holds all of chef's attributes, whether from the
cookbook, environment, role or node.
If we run our recipe again, we'll see that our nginx.conf file gets
updated again using our attribute value.
When constructing a template, you might be tempted to replace every
possible configuration option with a matching attribute. Most of the
time you really only need to change a select few options, so try to
constrain yourself to major options.
Template Variables

There's another way to specify attributes directly in template blocks in
recipe files. It's not used very often, but here's what it would look
like if we used it in recipes/default.rb:
template "/etc/nginx/nginx.conf" do
  notifies :reload, "service[nginx]"
  variables :user => 'www-data'
end

Here we pass a hash to the variables method. Then in our template file
we'd reference it as @user:
user <%= @user %>;

Whenever possible, use default attributes rather than variables. We
should now back out these changes from our own recipe and template
files.
Advanced Configuration Using Resources

The resources and providers directories are usually not needed, but
are useful for auxiliary configurations. The directories are used to
create an LWRP - Light Weight Resource Provider. An example LWRP might
be to create a resource for managing the creation of an nginx virtual
host file. As with services and notification, we'd use resources inside
a recipe. It's a way of taking groups of resources and giving them more
functionality and integration within Chef by emulating the syntax of
built-in resources, extending the chef DSL for things we do in many
places.
Chapter 6 - Use your SSH key

Taking a break from our nginx example, we'll build a recipe to setup a
Rails app and deploy using Capistrano. This will be easier if we copy
our SSH key to the server (aka node). Copy/paste the contents of
~/.ssh/id_rsa.pub on the local machine to ~/.ssh/authorized_keys on
the VM. (Make sure you are not sudo'd) There is already one key in that
file which is what vagrant uses to authenticate us through
vagrant ssh. We don't want to use this key for any public server since
the same key is used with ALL vagrant gems, but we can leave it for
our own use.
Chapter 7 - Setup a Local Hostname

Confirm that nginx is running on the VM
ps auwx | grep nginx

We'd like to access nginx directly from our desktop web browser. Our
Vagrantfile specified a config.vm.network option that lets us
connect to the VM from our virtual host at IP address 33.33.33.10. We
can add this to our local hosts file to access the server using a more
friendly name. On the local machine, run sudo vim /etc/hosts and add
the following:
33.33.33.10 kayak.test

If we try to access http://kayak.test in our local browser, we should
now see a 404 error page from nginx. We can call this a success since
it shows that nginx is handling our requests and serving responses.
Chapter 8 - Build the Rails Recipe

Goals:

Import the Nginx and Unicorn cookbooks
Understand the existing Unicorn recipe
Create a Rails recipe
Understand metadata
Install Ruby gems
Create directories and files
Edit the run list
Create and configure templates
Reuse variables

The starter code already has recipes for Rails and Unicorns. We're going
to create the former from scratch, but leave the latter one untouched.
Looking at the Unicorn recipe we see it:


Installs the Unicorn ruby gem with the gem_package method.
We could install this using the Gemfile in the Rails app, but
we want to do it in Chef to ensure it is present.


Creates a directory for storing configuration files


Installs a cookbook file, a new type of chef resource that lets us
copy plain, static files and copy them to the node. In this case,
we're copying a Ruby script to control the Unicorn process.


We're going to write our Rails recipe from scratch:
rm -rf rails/
mkdir rails
mkdir rails/attributes rails/templates rails/recipes
touch rails/recipes/default.rb

We need our recipe to:

Run the nginx and unicorn installers
Setup a metadata file to define those dependencies
Create some directories
Configure the Unicorn app server
Configure nginx to serve this as a virtual host

Specifying Dependencies

We need chef to run the nginx and unicorn recipes first, and then use
resources from those recipes. To get Chef to do this, we add the
following to our recipes/default.rb file like so:
include_recipe "nginx"
include_recipe "unicorn"


An older syntax you might see is require_recipe, but don't use that;
it's been deprecated.

We also need to define those recipes as dependencies so that chef server
will deliver those recipes to the node when using chef client. We can
copy the one from the unicorn cookbook to use as a starting point.
cp unicorn/metadata.rb rails/

It currently looks like this:
maintainer        "Joshua Sierles"
maintainer_email  "joshua@diuvia.net"
description       "Configures unicorn"
version           "0.1"

We want to add these two lines to tell chef that it can't run without
access to the other two cookbooks:
depends           "nginx"
depends           "unicorn"

Note that we are specifying the name of a top-level cookbook, not an
individual recipe. This is for coordination with the chef server, but
you should always do it even if you're only working with chef-solo.
Installing Gems

Back in recipes/default.rb, add the following line to ensure we have
access to sqlite and the bundler gem, both of which we'll need to setup
the rails app:
package "libsqlite3-dev"
gem_package "bundler" # builds a ruby gem, not a debian package


The video left out the part about adding the "libsqlite3-dev" package,
but the original files had it, and Capistrano was failing without, so
I've added it back.

Using Ruby in Recipes

We need to create a few directories for the application itself, and for
log and config files. Recipe files are just Ruby files, so we can use
the Ruby language in chef recipes. We can add a Ruby hash for common
variables:
common = {:name => "kayak", :app_root => "/u/apps/kayak"}

We'll use this throughout the recipe, as well as in templates.

We are hardcoding the app directory here because this recipe is
specific to that Rails app

Creating Directories

The directory method creates a directory with optional attributes.
Add the following:
directory common[:app_root] do
  owner "vagrant"
  recursive true
end


This creates our root directory. The original video left out any step
that created the /u/apps folder, so I've added recursive true to
the block to create it for us if it doesn't already exist. Note that
recursive doesn't apply the owner to parent folders, but that's OK
in this case as we want those folders to be owned by root, who we
should be sudod in with.

Normally, Capistrano would symlink the latest release into common,
but we're going to customize this by using a git-based deploy that
checks out a git repo inside the current directory

The video had us creating common[:app_root]+"/common", but we don't
need it so I've left it off. The current directory will be created
by Capistrano as necessary, with vagrant as the owner, so we don't
need to do that here.

Using Ruby Metaprogramming

We need to add a few more directories. We can use a bit of Ruby
metaprogramming:

The steps in the video assumed that common[:app_root]+"/shared"
would be created recursively in the loop below (using the
recursive true command), but I found that lead to permissions errors
later when deploying with Capistrano since shared was not owned by
vagrant. (See note above about recursive permissions.) So we'll
create it explicitly here.

directory common[:app_root]+"/shared" do
  owner "vagrant"
end

%w(config log tmp sockets pids).each do |dir|
  directory "#{common[:app_root]}/shared/#{dir}"
  recursive true # create parent directories as needed
  mode 0755
end


We're leaving an intentional bug in our code so we can debug it. Also,
we won't use that recursive statement. It will be replaced below.

Testing your Recipe

Chef is to servers as Unit tests are to code, and like unit tests you
should run your chef recipes often along the way to help you catch
errors along the way.
The last time we ran chef-solo on the VM we specified a run list using
JSON. We want to update that to specify the new recipe we're building.
sudo su
vim /etc/chef/node.json

Then inside that file:
{
  "run_list": ["recipe[rails]"]
}

Note that we don't need to add it and keep the old recipe since the new
recipe includes the old recipe as a dependency. The downside to this is
that simply looking at a runlist doesn't always tell you all of the
recipes that will be run since each recipe might run other recipes.
We also need to setup our apps directory on the VM
mkdir /u
mkdir /u/apps

Now we can run our cookbook on the VM:
chef-solo -l info -j /etc/chef/node.json

Debugging Recipes

This time we see an error in the output:
================================================================================
Recipe Compile Error in /cookbooks/rails/recipes/default.rb
================================================================================


NoMethodError
-------------
No resource or method named `recursive' for `Chef::Recipe "default"'


Cookbook Trace:
---------------
  /cookbooks/rails/recipes/default.rb:18:in `block in from_file'
  /cookbooks/rails/recipes/default.rb:16:in `each'
  /cookbooks/rails/recipes/default.rb:16:in `from_file'


Relevant File Content:
----------------------
/cookbooks/rails/recipes/default.rb:

 11:  
 12:  directory common[:app_root]+"/common" do
 13:    owner "vagrant"
 14:  end
 15:  
 16:  %w(config log tmp sockets pids).each do |dir|
 17:    directory "#{common[:app_root]}/shared/#{dir}"
 18>>   recursive true # create parent directories as needed
 19:    mode 0755
 20:  end
 21:  

Fix the code by passing in a block:
%w(config log tmp sockets pids).each do |dir|
  directory "#{common[:app_root]}/shared/#{dir}" do
    owner "vagrant"
    mode 0755
  end
end

Run chef-solo again, and it works! The output should include the
following (snipped) confirmations:
Recipe: nginx::default
  * package[nginx] action install[2014-03-18T19:34:53+01:00] INFO: Processing package[nginx] action install (nginx::default line 1)
 (up to date)
  * service[nginx] action enable[2014-03-18T19:34:53+01:00] INFO: Processing service[nginx] action enable (nginx::default line 3)
 (up to date)
  * service[nginx] action start[2014-03-18T19:34:53+01:00] INFO: Processing service[nginx] action start (nginx::default line 3)
 (up to date)
  * template[/etc/nginx/nginx.conf] action create[2014-03-18T19:34:54+01:00] INFO: Processing template[/etc/nginx/nginx.conf] action create (nginx::default line 8)
 (up to date)
Recipe: unicorn::default
  * gem_package[unicorn] action install[2014-03-18T19:34:54+01:00] INFO: Processing gem_package[unicorn] action install (unicorn::default line 1)
 (up to date)
  * directory[/etc/unicorn] action create[2014-03-18T19:34:54+01:00] INFO: Processing directory[/etc/unicorn] action create (unicorn::default line 5)
 (up to date)
  * cookbook_file[/usr/local/bin/unicornctl] action create[2014-03-18T19:34:54+01:00] INFO: Processing cookbook_file[/usr/local/bin/unicornctl] action create (unicorn::default line 9)
 (up to date)
Recipe: rails::default
  * gem_package[bundler] action install[2014-03-18T19:34:54+01:00] INFO: Processing gem_package[bundler] action install (rails::default line 4)
 (up to date)
  * directory[/u/apps/kayak] action create[2014-03-18T19:34:54+01:00] INFO: Processing directory[/u/apps/kayak] action create (rails::default line 8)
[2014-03-18T19:34:54+01:00] INFO: directory[/u/apps/kayak] created directory /u/apps/kayak

    - create new directory /u/apps/kayak[2014-03-18T19:34:54+01:00] INFO: directory[/u/apps/kayak] owner changed to 1000

    - change owner from '' to 'vagrant'

  * directory[/u/apps/kayak/common] action create[2014-03-18T19:34:54+01:00] INFO: Processing directory[/u/apps/kayak/common] action create (rails::default line 13)
[2014-03-18T19:34:54+01:00] INFO: directory[/u/apps/kayak/common] created directory /u/apps/kayak/common

    - create new directory /u/apps/kayak/common[2014-03-18T19:34:54+01:00] INFO: directory[/u/apps/kayak/common] owner changed to 1000

    - change owner from '' to 'vagrant'

  * directory[/u/apps/kayak/shared/config] action create[2014-03-18T19:34:54+01:00] INFO: Processing directory[/u/apps/kayak/shared/config] action create (rails::default line 18)
[2014-03-18T19:34:54+01:00] INFO: directory[/u/apps/kayak/shared/config] created directory /u/apps/kayak/shared/config

    - create new directory /u/apps/kayak/shared/config[2014-03-18T19:34:54+01:00] INFO: directory[/u/apps/kayak/shared/config] mode changed to 755

    - change mode from '' to '0755'

  * directory[/u/apps/kayak/shared/log] action create[2014-03-18T19:34:54+01:00] INFO: Processing directory[/u/apps/kayak/shared/log] action create (rails::default line 18)
[2014-03-18T19:34:54+01:00] INFO: directory[/u/apps/kayak/shared/log] created directory /u/apps/kayak/shared/log

    - create new directory /u/apps/kayak/shared/log[2014-03-18T19:34:54+01:00] INFO: directory[/u/apps/kayak/shared/log] mode changed to 755

    - change mode from '' to '0755'

  * directory[/u/apps/kayak/shared/tmp] action create[2014-03-18T19:34:54+01:00] INFO: Processing directory[/u/apps/kayak/shared/tmp] action create (rails::default line 18)
[2014-03-18T19:34:54+01:00] INFO: directory[/u/apps/kayak/shared/tmp] created directory /u/apps/kayak/shared/tmp

    - create new directory /u/apps/kayak/shared/tmp[2014-03-18T19:34:54+01:00] INFO: directory[/u/apps/kayak/shared/tmp] mode changed to 755

    - change mode from '' to '0755'

  * directory[/u/apps/kayak/shared/sockets] action create[2014-03-18T19:34:54+01:00] INFO: Processing directory[/u/apps/kayak/shared/sockets] action create (rails::default line 18)
[2014-03-18T19:34:54+01:00] INFO: directory[/u/apps/kayak/shared/sockets] created directory /u/apps/kayak/shared/sockets

    - create new directory /u/apps/kayak/shared/sockets[2014-03-18T19:34:54+01:00] INFO: directory[/u/apps/kayak/shared/sockets] mode changed to 755

    - change mode from '' to '0755'

  * directory[/u/apps/kayak/shared/pids] action create[2014-03-18T19:34:54+01:00] INFO: Processing directory[/u/apps/kayak/shared/pids] action create (rails::default line 18)
[2014-03-18T19:34:54+01:00] INFO: directory[/u/apps/kayak/shared/pids] created directory /u/apps/kayak/shared/pids

    - create new directory /u/apps/kayak/shared/pids[2014-03-18T19:34:54+01:00] INFO: directory[/u/apps/kayak/shared/pids] mode changed to 755

    - change mode from '' to '0755'

Configuring Unicorn

Now we need to make a Unicorn config file. We'll base it on a template.
The unicorn cookbook defines attributes that we can access here. We
access them via the standard node object, but this time without the
leading "@" sigil.

It's good practice when writing Chef recipes to publish default config
locations to other cookbooks by storing them as attributes within the
corresponding cookbook.

# recipes/default.rb
template "#{node[:unicorn][:config_path]}/#{common[:name]}.conf.rb" do
  mode 0644 # readable/writeable by owner, readable by others, not executable
end

We're storing the config file on the server under the application name
(common[:name]), but that could be confusing to have several files in
this cookbook named kayak.conf.rb - exactly what is being configured?
We can specify a specific template source name that is more obvious:
# recipes/default.rb
template "#{node[:unicorn][:config_path]}/#{common[:name]}.conf.rb" do
  mode 0644 # readable/writeable by owner, readable by others, not executable
  source "unicorn.conf.erb"
end

Using a generic name (unicorn versus kayak) means we should also use
this cookbook to configure other rails applications, and we'd only need
to change the application name in one place.
It would be handy to have access to the common variables within the
template:
# recipes/default.rb
template "#{node[:unicorn][:config_path]}/#{common[:name]}.conf.rb" do
  mode 0644 # readable/writeable by owner, readable by others, not executable
  source "unicorn.conf.erb"
  variables common
end

Remember that variables accepts a hash, and common is a hash, so now
we have access to @name and @app_root within our template.
To create the template:
mkdir rails/templates/default
# Shortcut -v
git checkout rails/templates/default/unicorn.conf.erb

The unicorn.conf.erb file contains some interesting syntax:
app_root = "<%= @app_root %>"

worker_processes 10
working_directory "#{app_root}/current"
preload_app true
timeout 300
listen "#{app_root}/shared/sockets/unicorn.sock", :backlog => 2048

pid "#{app_root}/shared/pids/unicorn.pid"
stderr_path "#{app_root}/shared/log/unicorn.log"
stdout_path "#{app_root}/shared/log/unicorn.log"

if GC.respond_to?(:copy_on_write_friendly=)
  GC.copy_on_write_friendly = true
end

# handle zero-downtime restarts

before_fork do |server, worker|
  old_pid = "#{server.config[:pid]}.oldbin"
  if old_pid != server.pid
    begin
      sig = (worker.nr + 1) >= server.worker_processes ? :QUIT : :TTOU
      Process.kill(sig, File.read(old_pid).to_i)
    rescue Errno::ENOENT, Errno::ESRCH
    end
  end
end

after_fork do |server, worker|

  # set process title to application name and git revision
  revision_file = "#{Rails.root}/REVISION"
  if ENV['RAILS_ENV'] != 'development' && File.exists?(revision_file)
    ENV["UNICORN_PROCTITLE"] = "<%= @name || "unicorn" %> " + File.read(revision_file)[0,6]
    $0 = ENV["UNICORN_PROCTITLE"]
  end

  # reset sockets created before forking
  ActiveRecord::Base.establish_connection
end

before_exec do |server|
  Dir.chdir("#{app_root}/current")
end

Note that on line 1 we're accessing a instance variable named
@app_root, but on line 4 we're accessing a local variable named
app_root. This is because the unicorn configuration file is
itself a Ruby script! So we're using Ruby (ERB) to write Ruby (the
.rb config file).

This is called "passive code generation"

Also on line 35 we're using the @name variable, but setting a default
value in case it isn't defined.
ENV["UNICORN_PROCTITLE"] = "<%= @name || "unicorn" %> " + File.read(revision_file)[0,6]

We know that we can set default values in the recipe file, or using
default attributes. Setting default values in the template make
attributes more bullet proof, but it comes at the cost of possibly
spreading defaults over several different files. Keep this in mind as
you develop a strategy for writing new cookbooks and templates.
Now that the template is in place, let's run chef again:
chef-solo -l info -j /etc/chef/node.json

We can see from the output that the template file has been written out
to the server. We can confirm using vim /etc/unicorn/kayak.conf.rb.
Configuring Nginx

Now we need to configure the nginx configuration template for this
application's virtual host. Nginx can serve many websites at once; we
need to create a config file specific for this application that tells
nginx how to handle requests for http://kayak.test. Nginx is pre-
configured to look for configuration files in the
/etc/nginx/sites-available/ folder.
# recipes/default.rb
nginx_config_path = "/etc/nginx/sites-available/#{common[:name]}.conf"

template nginx_config_path do
  mode 0644
  source "nginx.conf.erb"
  variables common.merge(:server_names => "kayak.test")
  notifies :reload, "service[nginx]"
end

To create the template:
# Shortcut -v
git checkout rails/templates/default/nginx.conf.erb

The nginx.conf.erb file contains:
upstream <%= @name %> {
  server unix:/u/apps/<%= @name %>/shared/sockets/unicorn.sock fail_timeout=0;
}

server {

  listen 80;
  server_name <%= @server_names %>;

  root /u/apps/<%= @name %>/current/public;

  access_log	/u/apps/<%= @name %>/shared/log/access.log combined;
  error_log   /u/apps/<%= @name %>/shared/log/error.log;

  location / {

    if (-f $request_filename) {
       break;
     }

     if (!-f $request_filename) {
       proxy_pass http://<%= @name %>;
       break;
     }
   }

  error_page  500 502 503 504 /500.html;
  error_page  404 /404.html;

  location = /500.html {
    root  /u/apps/<%= @name %>/current/public;
  }

  location = /400.html {
    root  /u/apps/<%= @name %>/current/public;
  }
}

As you can see, we're using the @name and @server_name variables to
configure this instance. We can go one step further and replace all
instances of /u/apps/<%= @name %> to <%= @app_root %>.
We have one more step: enable this site by linking it to the
sites-enabled directory provided by Debian. We'll use the nginx_site
resource defined by the nginx cookbook. (See nginx/resources/site.rb
[where the resource is defined] and nginx/providers/site.rb [where the
resource actions are defined])
# recipes/default.rb
nginx_site "kayak" do
  config_path nginx_config_path
  action :enable
end

The nginx_site resource knows that the server should be reloaded
after this is done, but we won't see any reload notification in Chef's
log messages because this is a LWRP (Light Weight Resource Provider) and
for better or worse they don't log all of their actions.
Finishing our Rails Server setup

Now we can run chef-solo. If we were to run it again, we can see
that chef doesn't do anything else because it knows it doesn't need to.
Even though we've done all these steps, if we visit http://kayak.test
in our local browser we still get a 404 error. This is because we
haven't actually deployed our application yet.
Part 2

Chapter 9 - Deploy Rails

Clone a preexisting Rails app from github into a new folder:
git clone https://github.com/jsierles/kayak.git

Why are we using a 3rd-party app to deploy when we've already got chef?
We use a special chef resource named chef-deploy. It could be used
for first time deployment, but it's slower than using Capistrano. It's
also more sensitive to errors - if another chef recipe fails along the
way, your rails app will never get deployed.
deploy.rb

Look at the config/deploy.rb script:
require "bundler/capistrano"
require 'fast_git_deploy/enable'

set :application, "kayak"
set :repository,  "https://github.com/jsierles/kayak.git"
set :deploy_to, "/u/apps/#{application}"
set :scm, :git

set :user, "vagrant"
set :branch, "master"
set :deploy_type, 'deploy'
set :use_sudo, false

default_run_options[:pty] = true
ssh_options[:forward_agent] = true
ssh_options[:keys] = [File.join(ENV["HOME"], ".vagrant.d", "insecure_private_key")]

role :app, "kayak.test"
role :web, "kayak.test"
role :db,  "kayak.test", :primary => true

after "deploy:setup" do
  deploy.fast_git_setup.clone_repository
  run "cd #{current_path} && bundle install"
end

namespace :unicorn do
  desc "Start unicorn for this application"
  task :start do
    run "cd #{current_path} && bundle exec unicorn -c /etc/unicorn/kayak.conf.rb -D"
  end
end

namespace :deploy do
  task :create_symlink do
  # no-op to remove default symlink task, not needed by fast_git_deploy
  end
end

Note that on line 2 we're using fast_git_deploy which works a little
differently than the standard Capistrano deploy task. It is recommended
as a faster alternative to the standard deployment task.
On line 7 we set the user to "vagrant" since that user already exists.
On line 16 an ssh option is set to use the vagrant demo key since this
is a private deployment. This works since the key is installed on our
VM, but this isn't what you'd want to use for a production deployment.
We could just remove that line and we'd be authenticated instead as the
current user.
On lines 18-21 we set the target server for all roles to "kayak.test",
which we setup before in /etc/hosts.
On line 27 we have a helper task that starts the unicorn server using
the unicorn config file we created in our rails recipe. Note that a
production app would likely use a process manager to handle unicorn
processes, but for this demo we'll just run it manually.

On line 35 it originally used task :symlink, but we must be using a
newer version of Capistrano because it is instead calling
deploy::create_symlink. I've updated the code above to override that
task instead.

Deploying with Capistrano

First we need to install Capistrano. On the local machine:
cd kayak
gem install bundler
bundle install

Now we'll deploy using deploy:cold since this is a fresh server and
it's the first time we've deployed this application. It will:

Create the applications folder at /u/apps/kayak/current
Checkout the latest revision from git

From the local machine:
bundle exec cap deploy:cold

After a few seconds you should see it complete with
 ** transaction: commit
  * executing `deploy:restart'

Starting the server

But if we try to load http://kayak.test in our browser we still get an
error. We need to start Unicorn on the server. We can do that via the
unicorn:start Capistrano task we created on line 27 of
config/deploy.rb:
bundle exec cap unicorn:start

In a production app we'd use a daemon monitoring script (bluepill,
monit, etc) to make sure Unicorn was up, and stayed up.
Chapter 10 - Attributes

Attributes are variables or parameters given to Chef and are used for
giving Chef instructions. They are passed in from the server, and from
recipes themselves. They are great places for storing configuration
values.
In the chapter on nginx, we setup default attributes and used them in
our template for configuring the nginx service:
default[:nginx][:dir] = "/etc/nginx"
default[:nginx][:worker_processes] = 4

Chef's attributes are complex - the result of historical design
decisions in early versions of Chef.
We know of three types of attributes:

default
normal
override

Right now we're only concerned with default and normal attributes.
Overrides are rarely required, usually in emergency situations when
we're not sure where an attribute value is coming from and we need to
explicitly override the value to get something to work.
Default attributes are the attributes we define in the code inside our
cookbooks. Normal attributes are defined either in the role or the
runlist file on the chef client.
User Account Cookbook

To demonstrate these two types of attributes, we'll setup a new cookbook
that will create user accounts on the client. For each user, we'll
create the user account and setup ssh keys for them to log into the
server with.
As with earlier exercises, there's already a completed users cookbook
in the sample code. We'll start by emptying the recipes folder on the
local computer:
cd users
rm recipes/default.rb
touch recipes/default.rb

As with the rails recipes we wrote, we'll pull in other completed
recipes to complete the tasks in this one. We'll use the ruby-shadow
recipe which gives us the ability to work with unix passwords from Ruby.
# users/recipes/default.rb
require_recipe 'ruby-shadow'

Looking at the ruby-shadow cookbook in TextMate, we see three
resources that we haven't seen before:

remote_directory copies an entire directory from the cookbook to
the remote server. Similar to cookbook_file from the unicorn
recipe, which copied a single file to the node. (remote_file is
another resource that copies files from remote URLs.) The source
method points to the directory that should be copied, relative to
the cookbook's files/default directory.
not_if is a "meta resource" or "meta command", which is useful for
any type of resource. This one will cause the remote_directory
command to be skipped if the files already exist on the remote node.
This is important for keeping this cookbook idempotent.
The bash resource will run a shell command with the options passed
in the block.

We'll start with one user, but we can imagine that we may have more
users to setup in the future. First we'll define our user attributes.
We can use a runlist in the root of our cookbooks directory:
# run_list.json
{
  "run_list": ["rails", "users"],
  "users": {
    "joshua": {
      "ssh_keys": {"mypublickey": "a_long_key_value"},
      "password": "$1$rIVP8MzN$C6A/X26wSngSNIJNLjjzc/"
    }
  }
}

To generate the encrypted password, which will work with ruby-shadow and
won't expose sensitive data, we can use the openssl command in unix:
openssl passwd -1 mypass

These attributes will be passed into the recipe when we run the recipe
using chef-solo. They will be available via the node method. This
method gives us access not only to these attributes, but also to all
other attributes from the underlying attributes, called "underlying
attributes".

As we saw when setting attributes in Chapter 5, attributes can be
referenced using symbols (as below), or string keys, or using dot
notation. This type of object is called a Mash, meaning a
syntactically flexible data structure.

# users/recipes/default.rb
require_recipe 'ruby-shadow'

node[:users].each do |name, conf|
  home_dir = "/home/#{name}"
  
  user name do
    password conf[:password]
    action [:create]
  end
  
  directory home_dir do
    owner name
    mode 0700
  end
  
  directory "#{home_dir}/.ssh" do
    owner name
    mode 0700
  end
  
  template "#{home_dir}/.ssh/authorized_keys" do
    owner name
    variables keys: conf[:ssh_keys]
    mode 0600
  end
end

Chef ships with a built-in user resource, which we see above. We pass
in a user name and provide a password and an action. Other actions are
available, but this is all we need for our purposes.
The template itself is already available in
users/templates/default/authorized_keys.erb.
<% @keys.each do |name, key| %>
# <%= name %>
<%= key %>
<% end %>

With everything in place, we can run our recipe on the VM. Since our
cookbooks directory is mirrored in the VM as /cookbooks, we can
reference our runlist directly
chef-solo -l info -j /cookbooks/run_list.json

We can verify this by changing to the "joshua" user on the VM:
su - joshua
pwd      
# => /home/joshua
cat .ssh/authorized_keys
# => # mypublickey
# => a_long_key_value

Chapter 11 - Ohai Automatic Attributes

ohai is a command-line tool for builds a data structure of automatic
attributes from your system.
Automatic attributes behave like all other attributes in the chef
system, except they are read-only. When chef was installed on the node
it installed another gem named ohai. We can test this on the VM:
ohai

The output is a huge json list of attributes:
{
  "languages": {
    "ruby": {
      "platform": "x86_64-linux",
      "version": "1.9.3",
      "release_date": "2012-02-16",
      "target": "x86_64-unknown-linux-gnu",
      "target_cpu": "x86_64",
      "target_vendor": "unknown",
      "target_os": "linux",
      "host": "x86_64-unknown-linux-gnu",
      "host_cpu": "x86_64",
      "host_os": "linux-gnu",
      "host_vendor": "unknown",
      "bin_dir": "/usr/local/bin",
      "ruby_bin": "/usr/local/bin/ruby",
      "gems_dir": "/usr/local/lib/ruby/gems/1.9.1",
      "gem_bin": "/usr/local/bin/gem"
    },
    "perl": {
      "version": "5.10.1",
      "archname": "x86_64-linux-gnu-thread-multi"
    },
    "python": {
      "version": "2.6.6",
      "builddate": "Dec 26 2010, 22:31:48"
    }
  },
  # ...

We can filter them like this:
ohai ipaddress
# => [
# =>   "10.0.2.15"
# => ]

This will access the top level of the json hash. The hash will be merged
into all the other attributes on every chef run. If you use chef-server,
these attributes will be stored on the server so you can query your
server farm by their attributes. Even for a single server, this is still
very useful information for chef internally. Chef uses it to figure out
how to install the right packages based on the platform architecture,
find out what version of a package is installed, etc.
Inside your recipes, you can reference these through the node method.
For an example look at ruby-shadow/recipes/default.rb:
not_if { File.exists?("#{node[:languages][:ruby][:bin_dir].gsub(/bin$/, "lib/ruby/site_ruby/1.9.1/")}#{node[:languages][:ruby][:platform]}/shadow.so") }

node[:languages][:ruby][:bin_dir] is an attribute defined by the
ohai application.
We can see the raw output of that attributes:
ohai languages
# => {
# =>   "ruby": {
# =>     "platform": "x86_64-linux",
# =>     "version": "1.9.3",
# =>     "release_date": "2012-02-16",
# =>     "target": "x86_64-unknown-linux-gnu",
# =>     "target_cpu": "x86_64",
# =>     "target_vendor": "unknown",
# =>     "target_os": "linux",
# =>     "host": "x86_64-unknown-linux-gnu",
# =>     "host_cpu": "x86_64",
# =>     "host_os": "linux-gnu",
# =>     "host_vendor": "unknown",
# =>     "bin_dir": "/usr/local/bin",
# =>     "ruby_bin": "/usr/local/bin/ruby",
# =>     "gems_dir": "/usr/local/lib/ruby/gems/1.9.1",
# =>     "gem_bin": "/usr/local/bin/gem"
# =>   },
# =>   "perl": {
# =>     "version": "5.10.1",
# =>     "archname": "x86_64-linux-gnu-thread-multi"
# =>   },
# =>   "python": {
# =>     "version": "2.6.6",
# =>     "builddate": "Dec 26 2010, 22:31:48"
# =>   }
# => }

Chapter 12 - Roles

What we've learned so far is great for setting up a single server. But
what if we have different servers with different specs and uses? This
is where chef roles come in.
Create a roles directory in our cookbooks directory. Normally we
shouldn't store roles directly alongside cookbooks, because roles aren't
part of cookbooks. But we're going to put it here for our demo because
this directory is already mounted on the VM.
Create a file in new directory named appserver.json. We'll imagine
that we have an application server that we may have 3 or 4 of. We want
the same attributes to appear on all app servers. Roles can also be
created as Ruby files, but we'll stick with JSON for now.
# roles/appserver.json
{
  "name": "appserver",
  "description": "Rails application server",
  "run_list": ["recipe[rails]", "recipe[users]"],
  "default_attributes": {
    
  },
}

We know that there is a hierarchy of attributes. In the run_list.json
file we set normal attributes at the root of the data structure. Node
attributes will override those coming from the role if the same keyname
is used for any attribute. Let's copy the "users" hash from that file
and paste it into appserver.json:
# roles/appserver.json
{
  "name": "appserver",
  "description": "Rails application server",
  "run_list": ["recipe[rails]", "recipe[users]"],
  "default_attributes": {
    "users": {
      "joshua": {
        "ssh_keys": {"mypublickey": "a_long_key_value"},
        "password": "$1$rIVP8MzN$C6A/X26wSngSNIJNLjjzc/",
      },
    },
  },
}

Any server that uses this role will automatically have access to this
hash of users. In addition to attributes, we can also specify a run list
for all of our app servers. We can also nest run lists inside other run
lists - for instance, a role can include another role.
We can now remove the node's runlist and tell it to use the role's
runlist instead.
# run_list.json
{
  "run_list": ["role[appserver]"],
}

Since we're using JSON and not Ruby to define our appserver role, we
need to add one more line:
# roles/appserver.json
{
  "json_class": "Chef::Role",
  "name": "appserver",
  "description": "Rails application server",
  "run_list": ["recipe[rails]", "recipe[users]"],
  "default_attributes": {
    "users": {
      "joshua": {
        "ssh_keys": {"mypublickey": "a_long_key_value"},
        "password": "$1$rIVP8MzN$C6A/X26wSngSNIJNLjjzc/",
      },
    },
  },
}

"json_class" connects this data with the corresponding Ruby class for
roles. It won't work without this.
Now we're ready to tell chef-solo about the role. Edit the chef-solo
configuration file on the VM
vim /etc/chef/solo.rb

We'll add a role path to point to the roles directory:
cookbook_path "/cookbooks"
role_path "/cookbooks/roles"

Now we're ready to run chef-solo with our modified run list:
chef-solo -l info -j /cookbooks/run_list.json

Attribute Hierarchy

It's important to understand how attributes interact along the hierarchy
of cookbook, environment, role and node.
Let's update one of our nodes to use different SSH keys. Since we've
already set the default attributes on the role for the "joshua" user,
let's copy the "users" hash back to the original run list:
# run_list.json
{
  "run_list": ["role[appserver]"],
  "users": {
    "joshua": {
      "ssh_keys": {
        "mypublickey": "different_value",
        "another_key": "new_value",            
      },
      "password": "$1$rIVP8MzN$C6A/X26wSngSNIJNLjjzc/",
    },
  },
}

These will be merged together by Chef using a deep merge. Back on the
VM:
chef-solo -l info -j /cookbooks/run_list.json

The rendered template for authorized_keys has been backed up and re-
written.
cat ~joshua/.ssh/authorized_keys
# => # mypublickey
# => different_value
# => # another_key
# => new_value

Multiple Roles

Roles aren't limited to a single role. Let's say we have a small server
farm, and we want to define an app server, a database server, and a
background job worker server. If we don't have many machines, all three
could use the same machine. Or we could nest roles so that one role
defers some or all of its behavior to another role.
Let's make a big app server that can run many more nginx worker
processes than a normal app server. We'll call this role bigappserver
touch roles/bigappserver.json

In a text editor:
# roles/bigappserver.json
{
  "json_class": "Chef::Role",
  "name": "bigappserver",
  "description": "Big rails application server",
  "run_list": ["role[appserver]"],
  "default_attributes": {
    "nginx": {
      "worker_processes": 10
    }
  }
}

"run_list": ["role[appserver]"] is an instance of a nested role. Chef
will first run the appserver role, then the bigappserver role.
To use our new role, just update the run list:
# run_list.json
{
  "run_list": ["role[bigappserver]"],
  "users": {
    "joshua": {
      "ssh_keys": {
        "mypublickey": "different_value",
        "another_key": "new_value"
      },
      "password": "$1$rIVP8MzN$C6A/X26wSngSNIJNLjjzc/"
    }
  }
}

On the VM:
chef-solo -l info -j /cookbooks/run_list.json

We see that the nginx config file has changed and the service was
reloaded. Did it work?
head /etc/nginx/nginx.conf
# => user www-data;
# => worker_processes  10;
# => 
# => error_log  /var/log/nginx/error.log;
# => pid        /var/run/nginx.pid;
# => 
# => events {
# =>     worker_connections  1024;
# =>     # multi_accept on;
# => }

Chapter 13 - Files and Directories

We'll examine cookbook_file and remote_directory - two useful
resources. One works with files, one with directories, but their
operation is essentially the same. (Their names are different for
historical reasons.)
cookbook_file takes a file out of the cookbook's files/default
folder and copies it to the node. Example:
# unicorn/recipes/default.rb
cookbook_file "/usr/local/bin/unicornctl" do
  mode 0755
end

What if we have several scripts and we'd like to install them all in a
directory. We can use remote_directory to do this without having to
specify every file name. Let's create a new folder from our unicorn
recipe
# unicorn/recipes/default.rb
# ...
remote_directory "/usr/local/myscripts" do
  files_mode 0755
end

files_mode specifies the permissions on the files in the directory,
not the directory itself.
Create the files directory:
cd unicorn/files/default
mkdir myscripts
touch myscripts/script1
touch myscripts/script2

Now in the VM:
chef-solo -l info -j /cookbooks/run_list.json
ls -Al /usr/local/myscripts/
# => -rwxr-xr-x 1 root staff 0 Mar 19 20:04 script1
# => -rwxr-xr-x 1 root staff 0 Mar 19 20:04 script2

Chapter 14 - Debugging and Distribution

Debugging Chef

Let's say we're debugging and running into problems. Chef has a special
debug log:
chef-solo -h

This shows us available options. We see that the -l option lets us
specify the logger output level.
chef-solo -l debug -j /cookbooks/run_list.json

The output is much more verbose than the normal run.
There's also a new feature: whyrun mode. This mode attempts to show
what would be changed without executing a real run.

Note that in the video this option is referred to as --whyrun, but
this was not a valid option on my install.

chef-solo --why-run
# => Starting Chef Client, version 11.10.4
# => Compiling Cookbooks...
# => Converging 0 resources
# => 
# => Running handlers:
# => Running handlers complete
# => 
# => Chef Client finished, 0/0 resources would have been updated

Because recipes are interdependent and dependent on other parts of the
system, such as running services, installed packages, or compiled
libraries, there's always a possibility that this option won't work for
certain resources or recipes.
To demonstrate, change the SSH key in our run list:
{
  "run_list": ["role[bigappserver]"],
  "users": {
    "joshua": {
      "ssh_keys": {
        "mypublickey": "yet_another_value",
        "another_key": "new_value"
      },
      "password": "$1$rIVP8MzN$C6A/X26wSngSNIJNLjjzc/"
    }
  }
}

Then on the VM:
chef-solo -l info -j /cookbooks/run_list.json --why-run

It finishes with:
Chef Client finished, 1/26 resources would have been updated

Looking at the rest of the output shows us:
    - Would update content in file /home/joshua/.ssh/authorized_keys from ffab71 to 0893ed
        --- /home/joshua/.ssh/authorized_keys	2014-03-19 19:42:09.000000000 +0100
        +++ /tmp/chef-rendered-template20140319-8977-12jmtwh	2014-03-19 20:18:10.000000000 +0100
        @@ -1,5 +1,5 @@
         # mypublickey
        -different_value
        +yet_another_value
         # another_key
         new_value

As useful as automating things is, one of the worst things is automating
a mistake and repeating that mistake across servers! So this option is
useful for previewing changes before committing them.
Distributing

Remote run lists let you pull cookbooks and run lists from remote URLs
instead of from the local file system. So far we've had the convenience
of running recipes from a folder mounted on our VM. But in production,
we need another way to access run lists and cookbooks.
One solution is to write your own web service that stores your run lists
and can serve the proper JSON run list for each node. This is one way
that EngineYard hosting service originally setup their cloud service
using chef-solo and an application that would generate run lists for
servers.
To do this, we need to serve a run list from a URL. Let's just use our
kayak server to do this. On the VM:
cp /cookbooks/run_list.json /u/apps/kayak/current/public/

Since kayak.test is a fake domain name, we also need to tell this node
about it.
vim /etc/hosts

Edit the first line to add an alias to localhost:
127.0.0.1       localhost kayak.test
Now to run chef-solo we just need to specify a URL instead of a file.
chef-solo -l info -j http://kayak.test/run_list.json

It's good practice to use an encrypted SSL source for security, along
with authentication. You can also use the -r flag in chef-solo for
retrieving gzipped run list files from a remote URL.