Skip to content

Instantly share code, notes, and snippets.

@aokomoriuta
aokomoriuta / test.cpp
Last active August 26, 2015 03:03
C++14だとけっこうすっきりする
#include <utility>
#include <boost/fusion/include/transform.hpp>
#include <boost/fusion/include/make_vector.hpp>
#include <boost/fusion/include/invoke.hpp>
class S
{
private:
int a;
@aokomoriuta
aokomoriuta / apu.txt
Last active August 29, 2015 14:01
profile of my computer with APU
===================================================
GPU Caps Viewer v1.20.1.1
http://www.ozone3d.net/gpu_caps_viewer/
===================================================
===================================[ System / CPU ]
- CPU Name: AMD A10-7850K Radeon R7, 12 Compute Cores 4C+8G
- CPU Core Speed: 3718 MHz
- CPU logical cores: 4
@aokomoriuta
aokomoriuta / table.csv
Created May 21, 2015 01:00
V&V plot.py issue
We can make this file beautiful and searchable if this error is corrected: It looks like row 6 should actually have 24 columns, instead of 1. in line 5.
#Dir,mx,my,mz,deltaT,endTime,method,nx,ny,nz,simulationType,LESModel,delta,solver,preconditioner,Build,Date,Time,nProcs,nCells,Steps,ExectutionTime0(s),ExectutionTime1(s),ExectutionTime/Steps(s)
n_00008/mpi_00001/simulationType_laminar-LESModel_laminar-delta_cubeRootVol-solver_GAMG-smoother_DIC,240,130,96,0.002000,0.022000,scotch,-,-,-,laminar,laminar,cubeRootVol,GAMG,DIC,,,,,,0,0,0,-0
n_00008/mpi_00001/simulationType_laminar-LESModel_laminar-delta_cubeRootVol-solver_GAMG-smoother_DIC,240,130,96,0.002000,0.022000,scotch,-,-,-,laminar,laminar,cubeRootVol,GAMG,DIC,,,,,,0,0,0,-0
n_00008/mpi_00001/simulationType_laminar-LESModel_laminar-delta_cubeRootVol-solver_GAMG-smoother_DIC,240,130,96,0.002000,0.022000,scotch,-,-,-,laminar,laminar,cubeRootVol,GAMG,DIC,,,,,,0,0,0,-0
n_00008/mpi_00001/simulationType_laminar-LESModel_laminar-delta_cubeRootVol-solver_GAMG-smoother_DIC,240,130,96,0.002000,0.022000,scotch,-,-,-,laminar,laminar,cubeRootVol,GAMG,DIC,dev-f6be0d888eec,May 21 2015,08:48:50,1,2995200,0,0,0,-0
n_00008/mp
@aokomoriuta
aokomoriuta / log.pimpleFoam.677
Created May 21, 2015 05:16
OpenCAE V&V issue#1 log*
/*---------------------------------------------------------------------------*\
| ========= | |
| \\ / F ield | OpenFOAM: The Open Source CFD Toolbox |
| \\ / O peration | Version: dev |
| \\ / A nd | Web: www.OpenFOAM.org |
| \\/ M anipulation | |
\*---------------------------------------------------------------------------*/
Build : dev-f6be0d888eec
Exec : pimpleFoam
Date : May 21 2015
@aokomoriuta
aokomoriuta / build.log
Created May 22, 2015 05:43
RapidCFD build.log
make: Nothing to be done for 'all'.
========================================
Start ThirdParty Allwmake
========================================
========================================
Build MPI libraries if required
have OPENMPI shared library (openmpi-1.8.4)
#define _SCL_SECURE_NO_WARNINGS
#define __CL_ENABLE_EXCEPTIONS
#ifdef _MSC_VER
#pragma warning(push, 1)
#pragma warning(disable: 4996)
#endif
#include <iostream>
#include <CL/cl.hpp>
#ifdef _MSC_VER
@aokomoriuta
aokomoriuta / 倍精度.txt
Created August 21, 2012 02:25
OpenCLでベクトル和を高速化するには1
https://github.com/aokomoriuta/StudiesOfOpenCLWithCloo/tree/master/VectorAddition/HeavyWorkItem の倍精度での結果。
単精度と同じでした(処理量変えても早くならない)。
= ベクトル加算の試験 =
1ワークアイテムの処理量を変えてみる
プラットフォーム:NVIDIA CUDA (OpenCL 1.1 CUDA 4.1.1)
デバイス数:2
* GeForce GTX 295 (NVIDIA Corporation)
* GeForce GTX 295 (NVIDIA Corporation)
@aokomoriuta
aokomoriuta / 倍精度.txt
Created August 30, 2012 02:06
OpenCLでベクトル和を高速化するには2
https://github.com/aokomoriuta/StudiesOfOpenCLWithCloo/tree/master/VectorAddition/MultiGpu の倍精度版。
倍精度にすると更に速い(単一CPUに比べてx6)。
あれ?倍精度演算のほうが計算速度遅いんじゃないの、と思ったが、たぶんメモリ律速のせい。
= ベクトル加算の試験 =
複数GPUを使う
プラットフォーム:NVIDIA CUDA (OpenCL 1.1 CUDA 4.1.1)
デバイス数:2
@aokomoriuta
aokomoriuta / 倍精度.txt
Created August 30, 2012 03:16
OpenCLでベクトル和を高速化するには3
https://github.com/aokomoriuta/StudiesOfOpenCLWithCloo/tree/master/VectorAddition/UseHostPointer の倍精度の結果
単精度と傾向は同じ。
ただしやはり加速率は倍精度のほうが上。
= ベクトル加算の試験 =
ホストポインタの使用有無での比較
プラットフォーム:NVIDIA CUDA (OpenCL 1.1 CUDA 4.1.1)
デバイス数:2
@aokomoriuta
aokomoriuta / Length2.cpp
Created December 6, 2012 08:14
thrustで3引数以上のtransformを実行する方法 ref: http://qiita.com/items/5f595c25c22d160fa0d0
#include<iostream>
// 2次元ベクトルCPU
void Length2()
{
// 要素数
const int N = 5;
// x, y方向成分
double x[N] = {0, 1, 2, 3, 4};