gonzalo-bulnes/simian-notes.md

## simian-notes.md

      
    Raw
  

              simian-notes.md
            
          
    Notes

A few notes taken while reading the github.com/mandykoh/simian code.
At a glance...

An index is a tree of nodes. Each node contains one or more entries. An entry is (roughly) a thumbnail of a given size, and the corresponding fingerprint. Within a node, entries are accessible by fingerprint.
Fingerprints provide a concept of "distance" between entries, and I guess (haven't read that yet) that the concept is used to make sure that all entries within a node are "closer" to each other than to any of the entries of any other node.
IndexEntry


created from a JSON file located at some path, e.g. some/path/example.json (whatever the extension is, but the file must be valid JSON)


inside that path, there is also a thumbnail (.thumb) named accordingly: e.g some/path/example.thumb


no fingerprint, no attributes are set when creating the entry from a file


can also be created from an image, at any arbitrary size


in that case, the key is a random 64 character string (hex representation of random 32 bytes)


the thumbnail is generated at the given size (double of the maxFingerprintSize of the entry -- whatever that is)


has a fingerprint (why is it called MaxFingerprint?)


Thumbnail


is an image.Image
has a size (the smallest of its dimensions)

Fingerprint


has a size (twice as small as the corresponding thumbnail)
is a function (specific/arbitrary) of a thumbnail of the given size (twice as small as the corresponding thumbnail for a given entry)
the function clears the six (magic number) less significant bits of the grayscale component (Y in YCbCr) of each pixel of the image (I think that means ignoring the most detailed information about each pixel of the image, i.e. "blurring" the image).