Navigation Menu

Skip to content

Instantly share code, notes, and snippets.

@xuorig
Created October 7, 2016 18:18
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save xuorig/85c04a97d85e16756338638b9b2db1a3 to your computer and use it in GitHub Desktop.
Save xuorig/85c04a97d85e16756338638b9b2db1a3 to your computer and use it in GitHub Desktop.
RPM New Relic

NewRelic's RPM

Main classes/modules

  • NewRelic::Agent
  • NewRelic::Agent::Agent
  • NewRelic::Agent::EventListener
  • NewRelic::Agent::Harvester
  • NewRelic::Agent::EventLoop
  • NewRelic::Agent::Threading::AgentThread

At boot, when using Unicorn, RPM will defer the start of the worker thread and instead wait for a connection to a Unicorn worker before start the thread. (I'm Assuming this is because you want to be sure it's going to be in a worker process and not master)

SCHEDULING, AND THE WORKER THREAD

Init Process

  • The main Agent creates a new instance of Harvester which takes the event listener as a Parameter.
  • in initialize it adds the start_transacation event to the listener to know when a new connection comes in.
  • When the event listener gets a notify for start_transaction it calls back on_connection on Harvester. Which starts the Agent Thread. by calling agent.after_fork
  • agent.after_fork Creates an EventLoop and subscribes to a bunch of events.
  • That EventLoop is ran inside an AgentThread which seems to be a wrapper around a "Simple" Ruby Thread.
  • Before starting the thread RPM connects to NewRelic servers

...

The Worker Agent / Event Loop

  • Inside that thread is an Event Loop running and collecting events.

def on

We can listen to events by using

loop.on(:event) do
  stuff
end
  • on's behaviour is quite meta: It fires an event on itself. An __add_event event :o.

  • fire basically adds an event to be processed inside a queue.

When we call run on the EventLoop it will start processing anything that's added. The first events it will see are __add_event events for which the callback is already defined.

You can guess what it does: setting up a new event, callback pair in subscribers.

Reporting events

Let's only take report_data event for example. We call

loop.on(:report_data) do
  transmit_data
end
  • This will have an effect of adding a new subscriber on the event loop. Next time we fire report_data, the callback will be called.

  • There's an other meta event we can add in the loop. It's called __add_timer. Add timer, as the name says, adds a timer to our loop.

  • Timers are used to send data every X amount of time. Sending the data over wire can take quite a lot of time so we dont want to bombard our server with possibly millions of event. The time we wait is configurable.

Processing events

  • The loop uses an infinite loop to process new events that are added to the event queue.
  • It uses IO.select with a timeout which is equal to the amount of time before the next timer should fire.
  • Now that we know a timer is ready to be fired, we fire all timers which will add an event to the queue when fired.
  • In the next line events are all popped of the queue, and their callbacks are fired too!

The Self Pipe Trick

  • Now we talked how on basically fires a meta event to add a new event + callback pair. What if a user set a really long timer time ? What if no timers are added ? The queue could wait indefinitely because it has no idea we're ready to process a new event. RPM uses the self_pipe trick to solve that problem. When we fire an event, the wakeup method is called which sends a byte of a IO.pipe that the loop created. Using IO.select we can listen for that pipe and unblock when we know fire has been called. This way we can process the __add_event event!

CONSUMING AND SENDING DATA OVER WIRE

Key Classes

  • NewRelic::Agent::NewRelicService (Speaks to New Relic Servers)
  • When asked to send data, tries and harvest from "containers"

Example Error DATA:

  • NewRelic::Agent::ErrorCollector -> NewRelic::Agent::ErrorTraceAgregator
  • Error Trace Agregator is basically just a queue with a Mutex. When you harvest from the queue, you get the entire queue, and the queue is reset to []

When data is harvested, send_data_to_endpoint (errors, /errors_data)

On the Rails Side

NewRelic::Agent.notice_error for example

init_plugin install_shim

include ControllerInstrumentations and ActionController

monkey patch process_action

#perform_action_with_newrelic_trace =

Transaction.start

Do the thing

Transaction.stop

throw stuff inside the ErrorTraceAgregator

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment