Skip to content

Instantly share code, notes, and snippets.

@zac-williamson
Last active September 3, 2019 13:03
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save zac-williamson/bc0774e2bd4cad6ffd5e2edd2166a30c to your computer and use it in GitHub Desktop.
Save zac-williamson/bc0774e2bd4cad6ffd5e2edd2166a30c to your computer and use it in GitHub Desktop.

Transcript files

Our setup produces 100,000,000 G1 points and 1 G2 point, over the BN128 curve.

This is split into multiple 'transcript' files. Each transcript file contains 5,000,000 G1 points. The first transcript file also contains 1 G2 point.

Our current plan is to not compress the points. The rationale was that it would take longer for the client to decompress the points, than it would to just upload/download the extra data

Data format

The transcript file contains raw binary data, where data elements are located by knowing their precise byte-position in the file. We write our data as follows:

For big-integer numbers (g1, g2 coordinates), we describe each 256-bit field element as a uint64_t[4] array. The first entry is the least significant word of the field element. Each 'word' is written in big-endian form.

For other integers (in the manifest section), variables are directly written in big-endian form.

Example Python decoder from Vitalik

def decode(chunk): return sum([int.from_bytes(chunk[i:i+8], 'big') * 256**i for i in range(0, 32, 8)])

Structure of a transcript file

The transcript file contains 4 linear data sections

section number section size (bytes) description
1 28 A 'manifest' containing metadata
2 320,000,000 5,000,000 uncompressed G1 points
3 256 or 0 (first transcript only) Two uncompressed G2 points
4 64 The 'checksum' - a BLAKE2B hash of the rest of the file's data

The G2 points

The first G2 point is z.[2], where z is the toxic waste from the previous participant. Is used, in combination with the previous participant's transcript, to check that the current transcript was built off of the previous participant's transcript The second G2 point is x.[2] where x is the trusted setup toxic waste.

Naming scheme

The transcript input files are called 'transcript0.dat', 'transcript1.dat', ..., 'transcript19.dat'

The transcript output files are called 'transcript0_out.dat', ..., 'transcript19_out.dat'

The manifest structure:

The manifest is 28 bytes of data with the following structure

byte positions description
0-4 transcript number (starting from 0)
4-8 total number of transcripts (should be 20)
8-12 total number of G1 points in all transcripts (should be 100,000,000)
12-16 total number of G2 points in all transcripts (should be 1)
16-20 number of G1 points in this transcript (should be 5,000,000)
20-24 number of G2 points in this transcript (2 for 1st transcript, 0 for the rest)
24-28 'start-from', the index of the 1st G1 point in this transcript

Regarding start-from: the value will be 0 in transcript0.dat, 5,000,000 in transcript1.dat, 95,000,000 in transcript19.dat etc

We have a bit of a continuity error, where for the first transcript, the 'local' number of G2 points is 2, when the 'total' number is 1. In the former, we're including the G2 element created by the participant, to verify transcripts. In the latter, we're referring to the total number of G2 elements in the structured reference string we're producing.

G1 point structure

The first G1 point will be x.[1], where x is the trusted setup toxic waste, and [1] is the bn128 G1 generator point (1, 2)

Structure is as follows: x.[1], x^{2}.[1], ..., x^{100,000,000}[1]

Each participant generates their own randomness z and exponentiates each point by z^{i}, where i is the G1 point index

G2 point structure

The only G2 point is x.[2], where [2] is the bn128 G2 generator point with coordinates:

{
    "x": {
        "c0": "10857046999023057135944570762232829481370756359578518086990519993285655852781",
        "c1": "11559732032986387107991004021392285783925812861821192530917403151452391805634"
    },
    "y": {
        "c0": "8495653923123431417604973247489272438418190587263600148770280649306958101930",
        "c1": "4082367875863433681332203403145435568316851327593401208105741076214120093531"
    }
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment