Abstract

Many times a scientist is choosing a programming language or a software for a specific purpose. For the field of scientific computing, the methods for solving differential equations are one of the important areas. What I would like to do is take the time to compare and contrast between the most popular offerings. This is a good way to reflect upon what's available and find out where there is room for improvement. I hope that by giving you the details for how each suite was put together (and the "why", as gathered from software publications) you can come to your own conclusion as to which suites are right for you. (Full disclosure, I am the lead developer of DifferentialEquations.jl. You will see at the end that DifferentialEquations.jl does offer pretty much everything from the other suite combined, but that's no accident: our software organization came last and we used these suites as a guiding hand for how to design ours.) Quick Summary Table If you just want a quick summary, I created a table which has all of this information. You can find it here (click for PDF): MATLAB's Built-In Methods Due to its popularity, let's start with MATLAB's built in differential equation solvers. MATLAB's differential equation solver suite was described in a research paper by its creator Lawerance Shampine, and this paper is one of the most highly cited SIAM Scientific Computing publications. Shampine also had a few other papers at this time developing the idea of a "methods for a problem solving environment" or a PSE. The idea is pretty simple: users of a problem solving environment (the examples from his papers are MATLAB and Maple) do not have the same requirements as more general users of scientific computing. Instead of focusing on efficiency, they key for this group is to have a clear and neatly defined (universal) interface which has a lot of flexibility. The MATLAB ODE Suite does extremely well at hitting these goals. MATLAB documents its ODE solvers very well, there's a similar interface for using each of the different methods, and it tells you in a table in which cases you should use the different methods. But the modifications to the methods goes even further. Let's take for example the classic ode45. That method just works and creates good plots, right? Well, Shampine added a little trick to it. When you solve an equation using ode45, the Runge-Kutta method uses a "free" interpolation to fill in some extra points. So between any two steps that the solver takes, it automatically adds in 4 extra points using a 4th order interpolation. This is because high order ODE solvers are good enough at achieving "standard user error tolerances" that they actually achieve quite large timesteps, and in doing so step too infrequently to make a good plot. Shampine's scheme is a good quick fix to this problem which most people probably never knew was occurring under the hood! There's quite a bit of flexibility here. The methods allow you to use complex numbers. You're given access to the "dense output function" (this is the function which computes the interpolations). There's a few options you can tweak. Every one of these methods is setup with event handling, and there are methods which can handle differential-algebraic equations. There are also dde23 and ddesd for delay differential equations, and in the financial toolbox there's an Euler-Maruyama method for SDEs. While MATLAB does an excellent job at giving a large amount of easily available functionality, where it lacks is performance. There's a few reasons for this. For one thing, these modifications like adding extra points to the solution array can really increase the amount of memory consumed if the ODE system is large! This actually has an impact in other ways. There's a very good example of this in ode45. ode45 is based on the Dormand-Prince 5(4) pair. However, in 1999, the same year the MATLAB ODE Suite was published, Shampine released a paper with a new 5(4) pair which was more efficient than the Dorma\nnd-Prince method. So that begs the question, why wasn't this used in the MATLAB ODE Suite (it's clear Shampine knew about it!)? (I actually asked him in an email) The reason is because its interpolation requires calculating some extra steps, so it's less efficient if you are ALWAYS interpolating. But since ode45 is always interpolating in order to make the plots look nicer, this would get in the way. Essentially, it can be more efficient, but MATLAB sets things up for nice plotting and not pure efficiency. But there are other areas where more efficient methods were passed up during the development phase of the ODE suite. For example, Hairer's benchmarks in his book Solving Ordinary Differential Equations I and II (the second is for stiff problems), along with the benchmarks from the Julia DifferentialEquations.jl suite, consistently show that high order Runge-Kutta methods are usually the most efficient methods for high accuracy solving of nonstiff ODEs. These benchmarks both consistently show that, for the same error, high order Runge-Kutta methods (like order >6) can solve the equation much faster than methods like Adams methods. But MATLAB does not offer high order Runge-Kutta methods and only offers ode113 (an Adams method) for high-accuracy solving. Some of this is due to a limitation within MATLAB itself. MATLAB's ODE solver requires taking in a user-defined function, and since this function is defined in MATLAB its function calls are very inefficient and expensive. Thus MATLAB's ODE solver suite can become more efficient by using methods which reduce the number of function calls (which multistep methods do). But this isn't the only case where efficient methods are missing. Both Hairer and the JuliaDiffEq benchmarks show that high-order Rosenbrock methods are the most efficient for low-medium accuracy stiff ODEs, but MATLAB doesn't offer these methods. It does offer ode23s, a low-order Rosenbrock method, and ode15s which is a multistep method. ode15s is supposed to be the "meat and potatoes of the MATLAB Suite", but it cannot handle all equations of since its higher order methods (it's adaptive order) are not L-stable (and not even A-stable). For this reason there's a few other low order SDIRK methods (ode23tb, an ESDIRK method for highly stiff problems) which are recommended to fill in the gaps, but none of the higher order variants which are known to be more efficient for many equations. This pattern goes beyond the ODE solvers. The DDE solvers are all low order, and in the case of ddesd, it's a low accuracy method which is fast for getting plots correct but not something which converges to many decimal places all too well since it doesn't explicitly track discontinuities. This is even seen in the paper on the method which shows the convergence only matches dde23 to graphical accuracy on a constant-del\nay problem. Again, this fits in with the mantra of the suite, but may not hit all demographics. Shampine specifically made a separate version of ddesd for Fortran for people who are interested in efficiency, which is another way of noting that the key of ddesd is features and automatic usage, and not hardcore scientific computing efficiency. The mentioned SDE solver from the financial toolbox is only order 0.5 and thus requires quite a small dt to be accurate. And I can keep going, but I think you get the moral of the story. This suite was created with one purpose in mind: to make it very easy to solve a wide array of differential equations and get a nice plot out. It does a very good job at doing so. But it wasn't made with efficiency in mind, and so it's missing a lot of methods that may be useful if you want high efficiency or high accuracy. Adding these wouldn't make sense to the MATLAB suite anyways since it would clutter the offering. MATLAB is about ease of use, and not efficiency, and it does extremely well at what it set out to do. For software that was first made before the Y2K crisis with just a few methods added later, it still holds up very well. What would make sense to add to MATLAB are some "addon routines" though. Estimation of parameters in ODEs/DDEs, sensitivity analysis, etc. These are the kinds of things which users "can" do, but might not necessarily do "correctly" or in full (for example, for estimation of parameters you really want to use sensitivity analysis or autodifferentiation for the gradients, and in many cases you'll want to be using global or semi-global optimization methods. This is in contrast to simple Levenberg-Marquardt that I see code suggestions for in online threads). AMIGO2 is a good MATLAB toolbox to add to the standard library which adds these kinds of functionality, but MATLAB still gets a "None" in this category since I am only considered the built in features there. But while I'm at it, MATLAB does have a good Runge-Kutta Nystrom implementation in the FileExchange and it really should be cleaned up and added to the standard library. I think this kind of focus on expanding features would be a good fit for MATLAB. Hairer's Solvers (Fortran) Next I want to bring up some Fortran solvers because they will come up later. Hairer's Fortran solvers are a set methods with similar interfaces that were designed with efficiency in mind. Many of these methods are classics: dopri5, dop853, radau, and rodas will specifically show up in many of the suites which are discussed later. These methods are not too flexible: they don't allow event handling (though with enough gusto you can use the dense output to write your own), or numbers that aren't double-precision floating point numbers (it's Fortran). They have a good set of options for tweaking parameters to make the adaptive timestepping more efficient, though you may have to read a few textbooks to know exactly what they do. And that's the key to them: they will solve an ODE, stiff or non-stiff, and they will do so pretty efficiently, but nothing more. But even then, they show some age which don't make them "perfectly efficient". These solvers include their own linear algebra routines which don't multithread like standard BLAS and LAPACK implementations, meaning that they won't make full use of modern CPU architectures. The computations don't necessarily SIMD or use FMA. But most of all, to use it directly you need to use Fortran which would be turn off for many people. There is some flexibility in the offering. There are symplectic solvers for second order ODEs, the stiff solvers allow for solving DAEs in mass matrix form, there's a constant-lag nonstiff delay differential equation solver (RETARD), there is a fantastic generalization of radau to stiff state-dependent delay differential equations (RADAR5), and there's some solvers specifically for some "mechanical ODEs" commonly found in physical problems. Of course, to get this all working you'll need to be pretty familiar with Fortran, but this is a good suite to look at if you are. ODEPACK and Netlib ODE Solvers (Fortran) ODEPACK is an old set of ODE solvers which accumulated over time. You can learn a bit about its history by reading this interview with Alan Hindenmarsh. I also bundle the Netlib suite together with it. There's a wide variety of methods in there. There's old Runge-Kutta methods due to Shampine, some of Shampine's old multistep methods ddebdf and ddeabm, etc. The reason why this pack is really important though is because of the Lawarance Livermore set of methods, specifically LSODE and its related methods (LSODA, LSODR, VODE, etc.). It includes methods for implicit ODEs (DAEs) as well. These are a group of multistep methods which are descendent from GEAR, the original BDF code. They are pretty low level and thus allow you to control the solver step by step, and some of them have "rootfinding capabilities". This means that you can use these to add event handling, though an event handling interface will take some coding. There's lots of options and these are generally pretty performant for a large array of problems, but they do show their age in the same way that the Hairer codes do. Their linear algebra setups do not make use of modern BLAS and LAPACK, which in practical terms means they don't make full use of modern computer architectures to speed things up. They don't always make use of SIMD and other modern CPU acceleration features. They are the vanilla ODE solvers which have existed through time. In particular, LSODA is popular because this is the most widely distributed method which automatically detects stiffness and switches between integration methods, though it should be pointed out that there is a performance penalty from this feature. While its ODE solvers are not particularly interesting anymore (pretty much every ODE solver since was created to improve upon the designs here), there are many non-standard pieces that you still cannot find elsewhere. For example, the BVP solver COLDAE is probably the best BVP solver for DAEs still. It has a multiple shooting BVP solver as well which allows you to give it integration schemes (MUS). It has Runge-Kutta Chevyschev (RKC) methods for first and second order ODEs which are highly-stable methods for semi-stiff equations (typically noted as good for parabolic PDEs). It also has an implicit RKC method as well. It has a MIRK-based adaptive BVP solver, similar to MATLAB's bvp4c but with a Fortran implementation. And the implicit ODE (DAE) solvers DASSL, DASKR, and DASPK were the precursors to Sundials' IDA and have different characteristics which can be good/bad depending on the problem. Altogether, there's some interesting stuff here, but the interfaces are not too uniform. NAG ODE Solvers (Fortran) The NAG ODE solvers are another classic set of Fortran solvers. If you take a look through them, you'll notice they not only kept ODEPACK and Netlib in their mind... they are very similar. It has a few Runge-Kutta methods (directly derived from RKSUITE), and a multistep solver with Adams and BDF versions. Like the other Fortran suite, it has "event handling" in terms of "you can control it step-by-step and have a rootfinder, so you can write your own event handling". They do hold your hand a little bit more by providing driver functions which do some cool stuff, but this is still definitely the old-school way of having a feature. It also has a good set of BVP solvers covering a wide range of algorithms, similar to ODEPACK and Netlib. While this is one solver I haven't used myself, I am sure that its efficient like the other Fortran methods, but it hasn't updated with implicit Runge-Kutta or Rosenbrock methods which are known to be more efficient in many cases (in their modern form). Thus for the chart I am just going to clump them with ODEPACK and Netlib and think about them as an additional set of methods to add to that group. Sundials and ARKCODE (C++ and Fortran) Sundials' CVODE is a re-write of VODE to C which is a descendent of LSODE which is a descendent itself of the original GEAR multistep code. Yes, this has a long history. But you should think of it as "LSODE upgraded": it makes use of modern BLAS/LAPACK, and also a bunch of other efficient C/Fortran linear solvers, to give a very efficient Adams and BDF method. Its solver IDA is like CVODE but handles implicit ODEs (DAEs). The interface for these is very similar to the ODEPACK interface, which means you can control it step by step and use the rootfinding capabilities to write your own event handling interface. Since the Adams methods handle nonstiff ODEs and the BDF methods handle stiff ODEs, this performance plus flexibility makes it the "one-stop-shop" for ODE solving. Many different scientific computing software internally are making use of Sundials because it can handle just about anything. Well, it does have many limitations. For one, this only works with standard C++ numbers and arrays. There's no stochasticity or delays allowed. But more importantly, Hairer and DifferentialEquations.jl's show that these multistep methods are usually not the most efficient methods since high order Rosenbrock, (E)SDIRK, and fully implicit RK methods are usually faster for the same accuracy for stiff problems, and high order Runge-Kutta are faster than the Adams methods for the same accuracy. Multistep methods are also not very stable at their higher orders, and so at higher tolerances (lower accuracy) these methods may fail to have their steps converge on standard test problems (see this note in the ROBER tests), meaning that you may have to increase the accuracy (and thus computational cost) due to stiffness issues. But, since multistep methods only require a single function evaluation per step (or are implicit in only single steps), they are the most efficient for asymptotically hard problems (i.e. when the derivative calculation is very expensive or the ODE is a large 10,000+ system). For this reason, these methods excel at solving large discretizations of PDEs. To top it off, there are parallel (MPI) versions for using CVODE/IDA for HPC applications. I also want to note that recently Sundials added ARKCODE, a set of Runge-Kutta methods. These include explicit Runge-Kutta methods and SDIRK methods, including additive Runge-Kutta methods for IMEX methods (i.e. you can split out a portion to solve explicitly so that the implicit portion is cheaper when you know your problem is only partly or semi stiff). This means that it covers the methods which I mentioned earlier are more efficient in many of the cases (though it is a bit lacking on the explicit tableaus and thus could be more efficient, but that's just details). However, benchmarks show that ARKODE is not implemented with the same quality as CVODE, with its default settings usually diverging on stiff problems and tweaked parameters not performing well in benchmarks (see the edits 4/21/2018 for details). Thus it should be considered separate from the rest of the "Sundials brand", giving a form of completeness but not with the same level of performance as you would expect. If you are using C++ or Fortran and want to write to only one interface, the Sundials suite is a great Swiss Army knife. And if you have an asymtopically large problem or very expensive function evaluations, this will be the most efficient as well. Plus the parallel versions are cool! You do have to live with the limitations of low-level software forcing specific number and array types, along with the fact that you need to write your own event handling, but if you're "hardcore" and writing in a compiled language this suite is a good bet. SciPy's Solvers (Python) Now we come to SciPy's suite. SciPy 1.0 includes some of its own Runge-Kutta methods, and has tableaus for Dormand-Prince 4/5 and Bogacki-Shampine 2/3. However, its basic Runge-Kutta integrator is written directly in Python with loops, utilizes an old school timestepping method instead of newer more efficient ones (this makes it less stable and more likely to diverge than most implementations), it doesn't have very many options, etc. So this is a definite step backwards in terms of "hardcore efficiency" and the features for optimizing an RK method to a given problem. However, this is a big step forward because it allows the methods to solve on matrices and tensors instead of just arrays, allows for a few of the methods to have dense output, and more flexibility. These same changes were made to its BDF method as well. The developer chats seem to say it's around a 10x performance loss for problems with around 200 ODEs, smaller problems getting more overhead and larger ones getting less (due to the use of vectorization when possible). Using a Python interface with classes and the like, it does offer a step-by-step interface. It now has some basic event handling, but for example it only checks the endpoints so if your function isn't monotonic it can easily miss it along with other things like floating point issues which come from not doing a pullback, so it's still at the point where it's a decent undergraduate homework problem to ask them to find a way to break it (there are several quick ways). This in't just theoretical, users have posted more than issue with easy ways to break its event handling, so even for simple cases use caution. It still offers wrappers to LSODA and radau (Hairer's and ODEPACK's methods). It has dop853 and dopri5 hidden in an old interface. In the developer's tests these are more efficient, but they don't have these nice extra features like post-solution interpolation. It does have a big options list, so if you were planning on swapping out the linear solver for PETSc you're out of luck, but it does expose some of the internal options of LSODA for banded Jacobians. It only has ODE solvers, no differential-algebraic, delay, or stochastic solvers. It does include a single BVP solver though. The tableaus are all stored at double precision so even if higher precision numbers are accepted it wouldn't give a higher precision result. As with the methods analysis, its only higher order Runge-Kutta method for efficient solving of nonstiff equations is dop853 which is now berried in the legacy interface without extra features and it's missing Rosenbrock and SDIRK methods entirely, opting to only provide the multistep methods. I should note here that it has the same limitation as MATLAB though, namely that the user's function is Python code. Since the derivative function is where the ODE solver spends most of its time (for sufficiently difficult problems), this means that even though you are calling Fortran code, you will lose out on a lot of efficiency. Still, if efficiency isn't a big deal and you don't need bells and whistles, this suite will do the basics. While before it was far behind MATLAB, SciPy 1.0's big update put it more mildly behind Python. It handles a much smaller domain and is missing a bunch of methods, but if your problem is simple enough it'll do it and can handle some basic events as well. It's nice to teach a class with and make some plots, and then you can go grab a Sundials wrapper when you need it. deSolve Package (R) There's not much to say other than that deSolve is very much like SciPy's suite. It wraps almost the same solvers, has pretty much the same limitations, and has the same efficiency problem since in this case it's calling the user-provided R function most of the time. One advantage is that it does have event handling. Less vanilla with a little bit more features, but generally the same as SciPy. I do want to make a note about its delay differential equation solvers dede. The documentation specifically says: dede does not deal explicitly with propagated derivative discontinuities, but relies on the integrator to control the stepsize in the region of a discontinuity. dede does not include methods to deal with delays that are smaller than the stepsize, although in some cases it may be possible to solve such models. For these reasons, it can only solve rather simple delay differential equations. As a methods researcher, I am horrified by this. Here's a few things wrong with it: The error estimator is only at the end of the interval, meaning its a very bad check for discontinuities. This is why ddesd used a special residual. None of their linked methods do this. Not propagating continuities means order loss to 1. They allow using a multistep method over a discontinuity. This might converge to order 1? There's no theory for this. When delays are smaller than the step size, the method is actually implicit, even if the ODE solver it's using is explicit. What this means is that this algorithm in this case is essentially a zero iteration Picard solver for the fixed point problem. This is probably the worst part. They end with saying it's only for "simple" delay equations without mentioning these as the reasons. I think that's going too far: there is no good guarantee that the error is low or the method converges well. I will mark it as "Poor" in the table, but I would advise against using this method (and would recommend the developers to remove this from the docs because this is definitely an easy place for users to be mislead by a numerical result). PyDSTool (Python) PyDSTool is an odd little beast. Part of the software is for analytic continuation (i.e. bifurcation plotting). But another part of it is for ODE solvers. It contains one ODE solver which is written in Python itself and it recommends against actually using this for efficiency reasons. Instead, it wraps some of the Hairer methods, specifically dopri5 and radau, and recommends these. But it's different than SciPy in that it takes in the specification of the ODE as a string, and compiles it to a C function, and uses this inside the ODE solver. By doing so, it's much more efficient. We still note that its array of available methods is small and it offers radau which is great for high accuracy ODEs and DAEs, but is not the most efficient at lower accuracy so it would've been nice to see Rosenbrock and ESDIRK methods. It has some basic event handling and methods for DDEs (again as wrappers to a Hairer method). This is a pretty good suite if you can get it working, though I do caution that getting the extra (non-Python) methods setup and compiled is nontrivial. One big point to note is that I find the documentation spectacularly difficult to parse. Together, it's pretty efficient and has a good set of methods which will do basic event handling and solve problems at double precision. JiTCODE and JiTCSDE (Python) JiTCODE is another Python library which makes things efficient by compiling the function that the user provides. It uses the SciPy integrators and does something similar to PyDSTool in order to get efficiency. I haven't tried it out myself but I'll assume this will get you as efficient as though you used it from Fortran. However, it is lacking in the features department, not offering advanced things like arbitrary number types, event handling, etc. But if you have a vanilla ODE to solve and you want to easily do it efficiently in Python, this is a good option to look at. Additionally, JiTCDDE is a version for constant-lag DDEs similar to dde23. JiTCSDE is a version for stochastic differential equations. It uses the high order (strong order 1.5) adaptive Runge-Kutta method for diagonal noise SDEs developed by Rackauckas (that's me) and Nie which has been demonstrated as much more efficient than the low order and fixed timestep methods found in the other offerings. It employs the same compilation setup as JitCODE so it should create efficient code as well. I haven't used this myself but it would probably be a very efficient ODE/DDE/SDE solver if you want to use Python and don't need events and other sugar. Boost ODEINT Solver Library (C++) The Boost ODEINT solver library has some efficient implementations of some basic explicit Runge-Kutta methods (including high order RK methods) and some basic Rosenbrock methods (including a high order one). Thus it can be pretty efficient for solving the most standard stiff and nonstiff ODEs. However, its implementations do not make use of advanced timestepping techniques with its explicit RK method (PI-controllers) which makes it require more steps to achieve the same accuracy as some of the more advanced software, making it not really any more efficient in practice (though it does do Gustafsson acceleration in its Rosenbrock scheme to prevent "the Hump" behavior). It doesn't have event handling, but it is flexible with the number and array types you can put in there via C++ templates. It and DifferentialEquations.jl are the only two suites that are mentioned that allow for solving the differential equations on the GPU. Thus if you are familiar with templates and really want to make use of them, this might be the library to look at, otherwise you're probably better off looking elsewhere like Sundials. GSL ODE Solvers (C) To say it lightly, the GSL ODE solvers are kind of a tragedy. Actually, they are just kind of weird. When comparing their choices against what is known to be efficient according to the ODE research and benchmarks, the methods that they choose to implement are pretty bizarre like extrapolation methods which have repeatedly been shown to not be very efficient, while not included other more important methods. But they do have some of the basics like multistep Adams and BDF methods. This, like Boost, doesn't do all of the fancy PI-controlled adaptivity and everything though, so YMMV. This benchmark, while not measuring runtime and only uses function evaluations (which can be very very different according to more sophisticated benchmarks like the Hairer and DifferentialEquations.jl ones!), clearly shows that the GSL solvers can take way too many function evaluations because of this and thus, since it's using methods similar to LSODA/LSODE/dopri5, probably have much higher runtimes than they should. Mathematica's ODE Solvers Mathematica's ODE solvers are very sophisticated. It has a lot of explicit Runge-Kutta methods, including newer high order efficient methods due to Verner and Shampine's efficient method mentioned in the MATLAB section. These methods can be almost 5x faster than the older high order explicit RK methods which themselves are the most efficient class of methods for many nonstiff ODEs, and thus these do quite well. It includes interpolations to make their solutions continuous functions that plot nicely. Its native methods are able to make full use of Mathematica and its arbitrary precision, but sadly most of the methods it uses are wrappers for the classic solvers. Its stiff solvers mainly call into Sundials or LSODA. By using LSODA, it tends to be able to do automatic stiffness detection by default. It also wraps Sundials' IDA for DAEs. It uses a method of steps implementation over its explicit Runge-Kutta methods for solving nonstiff DDEs efficiently, and includes high ord

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call