ChrisRackauckas/diffeq_vs_torchsde.md

## diffeq_vs_torchsde.md

      
    Raw
  

              diffeq_vs_torchsde.md
            
          
    torchsde vs DifferentialEquations.jl / DiffEqFlux.jl (Julia)

This example is a 4-dimensional geometric brownian motion. The code
for the torchsde version is pulled directly from the
torchsde README
so that it would be a fair comparison against the author's own code.
The only change to that example is the addition of a dt choice so that
the simulation method and time step matches between the two different programs.
The SDE is solved 100 times. The summary of the results is as follows:

torchsde: 1.87 seconds
DifferentialEquations.jl: 0.00115 seconds

This demonstrates a 1,600x performance difference in favor of Julia on the Python library's README example.
Further testing against torchsde was not able to be completed because of these performance issues.
Note about regularity

We note that the performance difference in the context of neural SDEs is likely
smaller due to the ability to time spent in matrix multiplication kernels. However,
given that full SDE training examples like demonstrated here
generally take about a minute, we still highly expect a major performance difference
but currently do not have the compute time to run a full demonstration.

  
## stochasticdiffeq.jl
using StochasticDiffEq, StaticArrays
const μ = 0.5ones(4)
const σ = 0.1ones(4)
f(du,u,p,t) = du .= μ .* u
g(du,u,p,t) = du .= σ .* u
u0 = 0.1ones(4)
tspan = (0.0,1.0)
saveat = range(0.0,1.0,length=20)
prob = SDEProblem(f,g,u0,tspan)

@time for i in 1:100
  sol = solve(prob,SRIW1(),adaptive=false,dt=saveat[2])
end

#0.001154 seconds (16.50 k allocations: 1.109 MiB)
#0.001162 seconds (16.50 k allocations: 1.109 MiB)
#0.001148 seconds (16.50 k allocations: 1.109 MiB)

## torchsde.py
import torch
from torchsde import sdeint


class SDE(torch.nn.Module):

    def __init__(self, mu, sigma):
        super().__init__()
        self.noise_type="diagonal"
        self.sde_type = "ito"

        self.mu = mu
        self.sigma = sigma

    def f(self, t, y):
        return self.mu * y

    def g(self, t, y):
        return self.sigma * y

batch_size, d, m = 4, 1, 1  # State dimension d, Brownian motion dimension m.
geometric_bm = SDE(mu=0.5, sigma=1)
y0 = torch.zeros(batch_size, d).fill_(0.1)  # Initial state.
ts = torch.linspace(0, 1, 20)
ys = sdeint(geometric_bm, y0, ts)

def time_func():
    ys = sdeint(geometric_bm, y0, ts, adaptive=False, dt=ts[1], options={'trapezoidal_approx': False})

timeit.Timer(time_func).timeit(number=100)

# 1.8681289999999535 seconds
# 1.8695188000001508 seconds
	using StochasticDiffEq, StaticArrays
	const μ = 0.5ones(4)
	const σ = 0.1ones(4)
	f(du,u,p,t) = du .= μ .* u
	g(du,u,p,t) = du .= σ .* u
	u0 = 0.1ones(4)
	tspan = (0.0,1.0)
	saveat = range(0.0,1.0,length=20)
	prob = SDEProblem(f,g,u0,tspan)

	@time for i in 1:100
	sol = solve(prob,SRIW1(),adaptive=false,dt=saveat[2])
	end

	#0.001154 seconds (16.50 k allocations: 1.109 MiB)
	#0.001162 seconds (16.50 k allocations: 1.109 MiB)
	#0.001148 seconds (16.50 k allocations: 1.109 MiB)
	import torch
	from torchsde import sdeint


	class SDE(torch.nn.Module):

	def __init__(self, mu, sigma):
	super().__init__()
	self.noise_type="diagonal"
	self.sde_type = "ito"

	self.mu = mu
	self.sigma = sigma

	def f(self, t, y):
	return self.mu * y

	def g(self, t, y):
	return self.sigma * y

	batch_size, d, m = 4, 1, 1 # State dimension d, Brownian motion dimension m.
	geometric_bm = SDE(mu=0.5, sigma=1)
	y0 = torch.zeros(batch_size, d).fill_(0.1) # Initial state.
	ts = torch.linspace(0, 1, 20)
	ys = sdeint(geometric_bm, y0, ts)

	def time_func():
	ys = sdeint(geometric_bm, y0, ts, adaptive=False, dt=ts[1], options={'trapezoidal_approx': False})

	timeit.Timer(time_func).timeit(number=100)

	# 1.8681289999999535 seconds
	# 1.8695188000001508 seconds