natowi/meshroom-node-details

## meshroom-node-details
How the nodes work (reconstructed, personal project)
CC BY-SA

## CameraInit

reads image metadata
sensor width lookup

Calculates viewpoints.sfm with image metadata and Intrinsics

Intrinsics Kmatrix "f;0;ppx;0;f;ppy;0;0;1"

Create cameraInit.sfm

calculates distortionParams


Although cameraInit.sfm uses data from viewpoints.sfm, those parts are slightly modified


the sfm files are json files

FoV https://www.scantips.com/lights/fieldofviewmath.html
https://www.pointsinfocus.com/tools/depth-of-field-and-equivalent-lens-calculator/#{%22c%22:[{%22f%22:13,%22av%22:%228%22,%22fl%22:50,%22d%22:3048,%22cm%22:%220%22}],%22m%22:0}


CameraInit.sfm/viewpoints.sfm file structure:
(viewpoints.sfm is the file generated when importing images to meshroom. CameraInit.sfm is the output that is being used in the next node)
----
version: [Array]
views: [Array] _the different views (images) with unique ids and metadata_
intrinsics: [Array] _the camera intrinsics, views (images) with the same intrinsics share an id. A different zoom level or camera has a new id. Includes principalPoint and distortionParams_
----


## Feature Extraction (SIFT)

desc are binary files
feat are ascii ones

view_id.sift.feat -> table with the extracted *features*
view_id.sift.desc -> descriptors

Reference: http://www.vlfeat.org/overview/sift.html
https://dsp.stackexchange.com/questions/24346/difference-between-feature-detector-and-descriptor
---
The image origin (top-left corner) has coordinate (0,0)
The lower-right corner is defined by the image dimensions
For a landscape images 5000x2000 this is (5000,2000)


0-------------------------5000
.      x
.
.          x
.
.                     x
.     x
2000


---


view_id.sift.feat Matrix (without column title):

x		y		scale	orientation
2711.52	1571.74	308.335	4.75616

-
(to plot this, make y negative (*-1))

---
scale: square/circle size
orientation: line from origin rotated in radiant

------
## ImageMatching

Matches all images (tree)

197018718 907017304 1638077662
907017304 1638077662

_which images are matched against each other. Example_

W X Y Z
X Y Z
Y Z

W will be matched with X, Y, Z, then X with Y and Z and so on

-------

## FeatureMatching

0.matches.txt

Matches all features of the images from ImageMatching Pairs

#viewid1		#viewid2
197018718 907017304
1 #first maching pair
sift 2632 # detected matches
44 38
183 122
907017304 1638077662 #viewid2 #viewid3
1
sift 2707
90 74
110 134
197018718 1638077662 #viewid1 #viewid3
1
sift 1929
129 74

** StructureFromMotion

calculates poses

"poses": [
        {
            "poseId": "797046670",
            "pose": {
                "transform": {
                    "rotation": [
                        "0.99328929576636837",
                        "-0.10823948227899582",
                        "0.040750329960956289",
                        "0.11564708144936042",
                        "0.92507429929971252",
                        "-0.36175031904255572",
                        "0.0014585843125640811",
                        "0.36403537637312233",
                        "0.93138397950613383"
                    ],
                    "center": [
                        "-0.16712305009175787",
                        "1.6837678457953795",
                        "0.56603363841980026"
                    ]
                },
                "locked": "1"
            }
        },
     ]
}

camera rotation as a quaternion
https://github.com/alicevision/meshroom/blob/bc1eb83d92048e6f888c4762c7ffcaab50395da6/meshroom/ui/reconstruction.py#L293
https://math.stackexchange.com/questions/893984/conversion-of-rotation-matrix-to-quaternion
https://www.euclideanspace.com/maths/geometry/rotations/conversions/matrixToQuaternion/

_cameraInit.sfm is being augmented by the sfm node and saved as cameras.sfm:_

version: [Array]
featuresFolder: ["node-internal-folder-path"]
matchesFolder: ["node-internal-folder-path"]
views: [Array] _the different views (images) with unique ids and metadata_
intrinsics: [Array] _the camera intrinsics, views (images) with the same intrinsics share an id. A different zoom level or camera has a new id. Includes principalPoint and distortionParams_
poses: [Array] _the camera poses_


-----
**mesh filtering node

uses
https://github.com/bldeng/MeshSDFilter

------------------------------

------------------------------

---
Masking idea for generic background

https://docs.opencv.org/2.4/modules/features2d/doc/common_interfaces_of_descriptor_extractors.html

Descriptors are generated from features,
so it should be possible to filter descriptors before using masks

We do not want to manually generate all the masks,
so we could use the results from FeatureMatching.
When we select features in one image to be masked,
the matching features in the other images can be masked as well.
The corresponding descriptors need to be updated,
than we can do the SfM only for the relevant area.

Double click on FeatureMatching to load the Feature Masking GUI
Select features and highlight matching features in the other images

Button for brushes to select include/exclude areas

new node for re-computation of decribers
re-computation node could have a button or icon to mark it as "user interaction required"
	How the nodes work (reconstructed, personal project)
	CC BY-SA

	## CameraInit

	reads image metadata
	sensor width lookup

	Calculates viewpoints.sfm with image metadata and Intrinsics

	Intrinsics Kmatrix "f;0;ppx;0;f;ppy;0;0;1"

	Create cameraInit.sfm

	calculates distortionParams




	Although cameraInit.sfm uses data from viewpoints.sfm, those parts are slightly modified



	the sfm files are json files

	FoV https://www.scantips.com/lights/fieldofviewmath.html
	https://www.pointsinfocus.com/tools/depth-of-field-and-equivalent-lens-calculator/#{%22c%22:[{%22f%22:13,%22av%22:%228%22,%22fl%22:50,%22d%22:3048,%22cm%22:%220%22}],%22m%22:0}


	CameraInit.sfm/viewpoints.sfm file structure:
	(viewpoints.sfm is the file generated when importing images to meshroom. CameraInit.sfm is the output that is being used in the next node)
	----
	version: [Array]
	views: [Array] _the different views (images) with unique ids and metadata_
	intrinsics: [Array] _the camera intrinsics, views (images) with the same intrinsics share an id. A different zoom level or camera has a new id. Includes principalPoint and distortionParams_
	----





	## Feature Extraction (SIFT)

	desc are binary files
	feat are ascii ones

	view_id.sift.feat -> table with the extracted features
	view_id.sift.desc -> descriptors

	Reference: http://www.vlfeat.org/overview/sift.html
	https://dsp.stackexchange.com/questions/24346/difference-between-feature-detector-and-descriptor
	---
	The image origin (top-left corner) has coordinate (0,0)
	The lower-right corner is defined by the image dimensions
	For a landscape images 5000x2000 this is (5000,2000)


	0-------------------------5000
	. x
	.
	. x
	.
	. x
	. x
	2000


	---


	view_id.sift.feat Matrix (without column title):

	x y scale orientation
	2711.52 1571.74 308.335 4.75616

	-
	(to plot this, make y negative (*-1))

	---
	scale: square/circle size
	orientation: line from origin rotated in radiant

	------
	## ImageMatching

	Matches all images (tree)

	197018718 907017304 1638077662
	907017304 1638077662

	_which images are matched against each other. Example_

	W X Y Z
	X Y Z
	Y Z

	W will be matched with X, Y, Z, then X with Y and Z and so on

	-------

	## FeatureMatching

	0.matches.txt

	Matches all features of the images from ImageMatching Pairs

	#viewid1 #viewid2
	197018718 907017304
	1 #first maching pair
	sift 2632 # detected matches
	44 38
	183 122
	907017304 1638077662 #viewid2 #viewid3
	1
	sift 2707
	90 74
	110 134
	197018718 1638077662 #viewid1 #viewid3
	1
	sift 1929
	129 74

	** StructureFromMotion

	calculates poses

	"poses": [
	{
	"poseId": "797046670",
	"pose": {
	"transform": {
	"rotation": [
	"0.99328929576636837",
	"-0.10823948227899582",
	"0.040750329960956289",
	"0.11564708144936042",
	"0.92507429929971252",
	"-0.36175031904255572",
	"0.0014585843125640811",
	"0.36403537637312233",
	"0.93138397950613383"
	],
	"center": [
	"-0.16712305009175787",
	"1.6837678457953795",
	"0.56603363841980026"
	]
	},
	"locked": "1"
	}
	},
	]
	}

	camera rotation as a quaternion
	https://github.com/alicevision/meshroom/blob/bc1eb83d92048e6f888c4762c7ffcaab50395da6/meshroom/ui/reconstruction.py#L293
	https://math.stackexchange.com/questions/893984/conversion-of-rotation-matrix-to-quaternion
	https://www.euclideanspace.com/maths/geometry/rotations/conversions/matrixToQuaternion/

	_cameraInit.sfm is being augmented by the sfm node and saved as cameras.sfm:_

	version: [Array]
	featuresFolder: ["node-internal-folder-path"]
	matchesFolder: ["node-internal-folder-path"]
	views: [Array] _the different views (images) with unique ids and metadata_
	intrinsics: [Array] _the camera intrinsics, views (images) with the same intrinsics share an id. A different zoom level or camera has a new id. Includes principalPoint and distortionParams_
	poses: [Array] _the camera poses_


	-----
	**mesh filtering node

	uses
	https://github.com/bldeng/MeshSDFilter

	------------------------------

	------------------------------

	---
	Masking idea for generic background

	https://docs.opencv.org/2.4/modules/features2d/doc/common_interfaces_of_descriptor_extractors.html

	Descriptors are generated from features,
	so it should be possible to filter descriptors before using masks

	We do not want to manually generate all the masks,
	so we could use the results from FeatureMatching.
	When we select features in one image to be masked,
	the matching features in the other images can be masked as well.
	The corresponding descriptors need to be updated,
	than we can do the SfM only for the relevant area.

	Double click on FeatureMatching to load the Feature Masking GUI
	Select features and highlight matching features in the other images

	Button for brushes to select include/exclude areas

	new node for re-computation of decribers
	re-computation node could have a button or icon to mark it as "user interaction required"