I got GPT-4 Code Interpreter to write most of this for me, transcript here
I had to modify it a tiny bit to get it to run, the parameters for the queries were missing.
Results on my M2 MacBook Pro:
---------------
Mode | Scenario | Update QPS | Select QPS
---------------
Normal | Combined | 1843.3 | 29.6
---------------
Normal | Update Only | 2300.3 | 0.0
---------------
Normal | Select Only | 0.0 | 205874.9
---------------
WAL | Combined | 11641.0 | 462251.8
---------------
WAL | Update Only | 9229.7 | 0.0
---------------
WAL | Select Only | 0.0 | 356455.4
---------------
I don't think this is an accurate benchmark of concurrent access though, because it's using Python threads which are subject to the GIL. So treat these numbers with suspicion.
They do seem to very much demonstrate the power of WAL mode though.
It's weird that the number of selects in WAL combined mode beats select only in that mode too. I got the same result a few times, and I don't understand why.