Skip to content

Instantly share code, notes, and snippets.

perf stat - 10 iterations
OMP_NUM_THREADS=1
-----------------
Performance counter stats for 'mvn exec:java -Dexec.mainClass=org.deeplearning4j.examples.recurrent.character.GravesLSTMCharModellingExample':
19368.054321 task-clock (msec) # 1.727 CPUs utilized
6,844 context-switches # 0.353 K/sec
130 cpu-migrations # 0.007 K/sec
426,191 page-faults # 0.022 M/sec
69,717,328,193 cycles # 3.600 GHz (83.40%)
00:00.0 Host bridge: Intel Corporation Xeon E5/Core i7 DMI2 (rev 07)
Subsystem: Intel Corporation Device 4953
Flags: fast devsel
Capabilities: [90] Express Root Port (Slot-), MSI 00
Capabilities: [e0] Power Management version 3
Capabilities: [100] Vendor Specific Information: ID=0002 Rev=0 Len=00c <?>
Capabilities: [144] Vendor Specific Information: ID=0004 Rev=1 Len=03c <?>
Capabilities: [1d0] Vendor Specific Information: ID=0003 Rev=1 Len=00a <?>
Capabilities: [280] Vendor Specific Information: ID=0004 Rev=2 Len=018 <?>
@ianpjohnson
ianpjohnson / gist:2cc85d75d79b798dd5fcbee414b50895
Created May 21, 2016 21:13
Core i7 gist - is it real or a VM ?
lspci
00:00.0 Host bridge: Intel Corporation Xeon E5/Core i7 DMI2 (rev 07)
00:01.0 PCI bridge: Intel Corporation Xeon E5/Core i7 IIO PCI Express Root Port 1a (rev 07)
00:01.1 PCI bridge: Intel Corporation Xeon E5/Core i7 IIO PCI Express Root Port 1b (rev 07)
00:02.0 PCI bridge: Intel Corporation Xeon E5/Core i7 IIO PCI Express Root Port 2a (rev 07)
00:03.0 PCI bridge: Intel Corporation Xeon E5/Core i7 IIO PCI Express Root Port 3a in PCI Express Mode (rev 07)
00:05.0 System peripheral: Intel Corporation Xeon E5/Core i7 Address Map, VTd_Misc, System Management (rev 07)
00:05.2 System peripheral: Intel Corporation Xeon E5/Core i7 Control Status and Global Errors (rev 07)
00:05.4 PIC: Intel Corporation Xeon E5/Core i7 I/O APIC (rev 07)
00:11.0 PCI bridge: Intel Corporation C600/X79 series chipset PCI Express Virtual Root Port (rev 05)
@ianpjohnson
ianpjohnson / gist:1378e744e8e9dd57b82324500ffed1ff
Created May 21, 2016 20:13
Performance problems on Ubuntu 15.10 - libnd4j 3.10 snapshot
Hi - Am recompiling master (pulled last night) from sources and am having a problem on my 6 Core i7 with performance on Ubuntu
Any clues what i might be doing wrong ?
Testing org.deeplearning4j.examples.recurrent.character.GravesLSTMCharModellingExample
My Core i5 2500K Windows 8 (64) on Java 1.8 (also using OpenBLAS) is about 8-10 times FASTER than the Core i7 on Ubuntu 15.10
I have downloaded and rebuilt OpenBLAS, and followed instructions from git on libnd4j etc