delta1/minglejingle.md

## minglejingle.md

      
    Raw
  

              minglejingle.md
            
          
    Minglejingle: Scalable blockchain with non-interactive transactions

This article describes the Minglejingle (MJ) protocol: a redesign of Mimblewimble (MW) for non-interactive transactions. It preserves the security and privacy properties, supports payment proofs, non-interactive coinjoin and cut-through of spent outputs.
1. Introduction

Blockchains are distributed ledgers that preserve the transaction history so that new network participants can, at any point in the future, verify these two security properties:

Supply security: No counterfeit coins have been created.
Ownership security: No coins have been moved without the authorization by the owner of the associated private keys.

Bitcoin achieves unconditional supply security as a result of transparent transaction amounts. Ownership security is achieved by storing all historical transactions forever, which allows the signatures authorizing the transfer of coins to be verified at any time.
The Mimblewimble protocol, proposed in 2016 by an anonymous researcher [1], makes blockchains much more efficient, while still providing both of the security properties. MW replaces output values with homomorphic Pedersen commitments, which allows both the supply security and the ownership security to simplify to a single signature verification per transaction while removing the need to keep all historical ouputs in the blockchain.
However, the efficiency and elegance of MW comes at a cost. All transactions must be constructed interactively by the sender and all the receivers. This has both usability and security implications:

The sender and the receivers must typically be online at the same time and be able to communicate. Since most users connected to the Internet don't have a public IP address, the interaction is usually done via Tor or using a third party service.
The private keys are needed in order to receive funds. This means that the recipient must typically keep their private keys in a hot wallet connected to the internet. This poses an additional security risk.

There have been many proposals how to alleviate these issues and enable non-interactive/one-sided transactons in the MW protocol. However, these proposals either don't support payment proofs [2] or don't provide ownership security [3, 4].
1.1 Contribution

This article presents Minglejingle, a redesign of the MW protocol with the following features:

Transactions are fully non-interactive. The sender only needs to know the recipient's address, which consists of 2 public keys.
Addresses are not linkable to wallets.
Transactions outputs are not linkable to addresses.
Blocks don't link transaction inputs to outputs.
The protocol provides both supply security and ownership security.
Spent outputs are not needed for chain verification.
The protocol provides unconditional payment proofs.

As MJ represents a compromise between efficiency and usability, there are some drawbacks:

The protocol requires more complex consensus rules than MW.
The amount of data that new nodes need to download in order to verify the full transaction history is 2-3 times larger than in MW, depending on the fraction of spent outputs. However, this is still several times less than other non-interactive protocols with similar privacy properties (details in §6.1).

2. Cryptographic primitives

2.1 Notation

We assume that 𝔾 is a cyclic group of prime order q. Uppercase letters usually refer to group elements (public keys, commitments) and lowercase letters usually refer to numbers in Z(q) (scalars, private keys). We will use the additive notation for group operations.
Let G be the generator of 𝔾 and H be another element of 𝔾 with unknown discrete logarithm relationship to G.
2.2 Hash functions

We assume the existence of three hash functions:

H_d is a hash function {0,1}* -> {0,1}^2*λ ("hash-to-digest"), where λ is the security level in bits
H_s is a hash function {0,1}* -> Z(q) ("hash-to-scalar")
H_p is a hash function {0,1}* -> 𝔾 ("hash-to-point")

These hash functions are modeled as random oracles with uniform outputs over their domains.
2.2.1 Tagging

Tags are used to prevent the outputs of hash functions from being misused in a different context than intended. Tags will be denoted by capital letter T with a lower index specifying the name of the tag. Tags are passed to hash functions alongside other input fields. For example, T_send is a tag specifies that the input is used in the context of sending funds.
2.3 Pedersen commitments

Pedersen commitment C is an element of 𝔾 constructed as:
C = r * G + v * H

where r is the blinding factor and v is the value of the commitment. Pedersen commitments are homomorphic, which means that adding two commitments C₁ and C₂ to values v₁ and v₂ produces a commitment to (v₁ + v₂) mod q
2.4 Range proofs

When adding Pedersen commitments, their values are added modulo q. To prevent overflow, we need to restrict the possible values of v to a range much smaller than q. Range proofs are a special kind of zero-knowledge proofs that prove a commitment C is of the form r*G + v*H where 0 <= v < 2ⁿ and n is the required precision in bits, without revealing the values r or v. For monetary operations, 64-bit precision is more than enough to represent the possible range of values.
This article assumes the use of Bulletproofs+, a short and efficient range proof that requires 15 group elements and 3 scalars to prove that v is a 64-bit value. [5]. The proofs for several different commitments can be aggregated with just 2 additional group elements per commitment.
2.5 Diffie-Hellman key exchange

If Alice owns a keypair (x_a, P_a) and Bob owns a key pair (x_b, P_b), then x_a* P_b = x_b* P_a is a secret that only Alice and Bob can compute after sharing their public keys P_a and P_b. This is called the Diffie-Helmann (DH) key exchange protocol. [6]
2.6 Schnorr signatures

Schnorr signature is a digital signature scheme [7] that works as follows:
2.6.1 Signing

To sign a message m using the private key x (corresponding to the public key P), it is first necessary to select a random nonce value r and to calculate R = r*G. The signature then consists of the pair (R,s), where s = r + x*H_s(R, P, m).
2.6.2 Verification

A signature (R,s) of the message m can be verified with the public key P by checking s*G ?= R + H_s(R, P, m)*P
2.6.3 Interactive multi-signatures

Schnorr signatures of the same message with two or more private keys can be aggregated into one multi-signature (R,s) if the signers cooperate. This interactive protocol is described in §3.2.2 as part of the MW protocol.
2.6.4 Non-interactive half-aggregation

Two Schnorr signatures (R₁, s₁), (R₂, s₂) of two different messages m₁, m₂ with two different public keys P₁, P₂ can be partially aggregated into one signature (R₁, R₂, s) with s = s₁ + y*s₂, where y = H_s(T_agg, R₁, R₂, P₁, P₂, m₁, m₂) (or an equivalent random oracle output). The scheme can be easily extended for any number of signatures.
2.7 Elliptic curve choice

For the 128-bit security level, a good choice to represent the group 𝔾 is the ed25519 elliptic curve. It is a twisted Edwards curve that has complete point addition formulas, allowing fast and secure constant-time implementations. Although this curve has a cofactor of 8, this can be eliminated by using the Ristretto abstraction for group elements, which produces a prime-order group and protects from small-subgroup attacks [8].
2.7.1 Sizes

The table below lists the serialized sizes of the cryptographic primitives when using the ed25519 elliptic curve.


primitive
elements
size (bytes)


private key, scalar
Z(q)
32


public key, commitment
𝔾
32


range proof (BP+)
15x 𝔾, 3x Z(q)
576


2x aggregated BP+
17x 𝔾, 3x Z(q)
640


Schnorr signature
1x 𝔾, 1x Z(q)
64


3. Baseline protocols

3.1 Bitcoin with CT (BCT)

For comparison, we show what a simplified Bitcoin protocol might look like with Confidential Transactions (CT), i.e. output amounts hidden with Pedersen commitments.
3.1.1 Transaction inputs

In order to spend an output, the sender must add the following data to the blockchain:

The ID of the transaction and the index of the output that's being spent
A Schnorr signature (R, s) showing the possesion of the private key for the output
The difference d between the blinding factors of the outputs and the blinding factors of the inputs.

If multiple inputs are spent at once, the signatures become multi-signatures (§2.6.3), so two inputs require 168 bytes of data.
3.1.2 Transaction outputs

The following data needs to be stored in the BCT blockchain for each output:

The output commitment C = r*G + v*H
A range proof showing that 0 <= v < 2⁶⁴
The receiver's public key K_r
The sender's emphemeral public key K_e for DH key exchange
The value v encrypted with the shared key between the sender and the receiver. The blinding factor r can be derived deterministically from the shared secret.

If multiple outputs are created at once, the range proofs can be aggregated.
3.1.3 Verification

Each BCT transaction can be verified in 2 steps:

Checking that the difference between the output commitments and the input commitments is equal to d*G. This proves that the transaction values balance out.
Checking that (R, s) is a valid Schnorr multi-signature with the input public keys.

3.1.4 Privacy

The only privacy feature provided by BCT is that the transaction amounts are hidden.
3.1.5 Payment proofs

Since transactions are stored on the blockchain forever, payment proofs are easy. The sender only needs to point to a transaction output and provide the private key for K_e to reveal the value of the output.
3.2 Mimblewimble

Our main baseline is the interactive MW protocol.
3.2.1 Transactions

Each MW transaction consists of:

One or more inputs, which are just Pedersen commitments and refer to unspent outputs of previous transactions.
One or more outputs, each consisting of a Pedersen commitment and a range proof.
A transaction kernel, which consists of a public key K and an associated Schnorr signature (R, s).
An offset o, which can be aggregated.

3.2.2 Interactive protocol

If Carol, who owns an unspent output C_i = k_i*G + v_i*H wants to send v_o coins to Dave, the following interactive protocol must be executed:

Carol calculates the change output value v_c = v_i - v_o, constructs the change output C_c = k_c*G + v_c*H, selects a nonce r_c and offset o_c, then sends C_i, C_c, v_o, o_c and R_c = r_c*G to Dave.
Dave constructs his output C_o = k_o*G + v_o*H, constructs a range proof for C_o, selects a nonce r_d and offset o_d, calculates the kernel public key K = C_o + C_c - C_i - (o_c + o_d)*G, calculates the aggregated nonce R = R_c + r_d*G and constructs a partial signature s_d = r_d + (k_o-o_d)*H_s(R,K), then sends C_o, the range proof, R_d = r_d*G, o_d and s_d back to Carol.
Carol completes the signature as s = s_d + r_c + (k_c-k_i-o_c)*H_s(R,K) and can finally construct the transaction consisting of the input C_i, the outputs C_o and C_c with the associated range proofs, the kernel K, kernel signature (R,s) and the offset o = o_c + o_d.

3.2.3 Verification

The resulting transaction can be easily validated by checking:

C_o + C_c - C_i ?= K + o*G
s*G ?= R + K*H_s(R,K)

The signature verification in step 2 serves both as a balance proof (K doesn't contain any coins) and the proof of ownership.
3.2.4 Cut-through

The MW protocol allows for the removal of spent outputs and the inputs of all transactions while preserving the security properties. Each historical transaction only leaves behind the kernel, which is about 96 bytes at the 128-bit securit level.
3.2.5 Efficiency analysis

The main points that make MW so efficient are the following:

The use of the blinding factor to both obscure the values of the outputs and to authorize the spending.
Unspent outputs consisting of the absolute minimum amount of data (only a commitment + range proof).
The removal of spent outputs.
The kernel signature being an aggregated signature over all of the involved private keys.

3.2.6 Privacy

MW is more private than BCT. Since transaction inputs and outputs are only linked indirectly via the kernel public key and the kernel signature doesn't sign any output data, MW provides "accidental" privacy by enabling non-interactive coinjoin.
3.2.7 Payment proof

The MW blockchain doesn't include any provisions for payment proofs. If the Carol requires a payment proof, it must be provided by Dave in step 2 of the interactive protocol. The validity of the payment proof must be bound to the existence of the transaction kernel on the blockchain to resolve disputes in the case Carol doesn't complete step 3 or if Dave spends the output.
4. Towards a non-interactive protocol

4.1 Goals

The main design goals are the following:

Fully non-interactive transactions.
Both supply security and ownership security .
Unconditional payment proofs.
Privacy equivalent to the interactive MW protocol.
Smallest possible blockchain.
Fastest possible verification.

4.2 Simulating the receiver

For Carol to be able to construct a MW-like transaction non-interactively, she must be able to derive all values from step 2 of §3.2.2 that are provided by Dave in the interactive protocol. This can be achieved using the DH key exchange. The shared DH key can be used for the following:

Generating the blinding factor r_d for Dave's output commitment. The blinding factor must be known to Carol to enable her to construct the required range proofs.
Encrypting the value v_o and other data associated with the payment.


Observation 1


Because the blinding factor is known by both Carol and Dave, it is no longer sufficient as a proof of ownership of the output.


In order to restrict the ability to spend the output to Dave, the transaction output needs to include an output key that only Dave knows the private key for.
4.3 Addresses

To enable non-interactive transactions, we need to introduce addresses. For privacy reasons, Dave's output received from Carol should be bound to a unique one-time public key that is cryptographically unlinkable to Dave's public address.
Additionally, we want to enable Dave to generate a virtually unlimited number of different unlinkable addresses so that noone can trace his online activities.
We can achieve both of these properties with a variant of the subaddress scheme used by Monero. [9]
4.3.1 Address generation

Dave can generate a pair of private keys a and b (in practice, both keys can be derived from the same secret seed). a is called the private view key and can be used to recognize payments. b is the private spend key and is needed to spend outputs. Separating the viewing and spending ability allows the private view key to be revealed to an auditor without risking a loss of funds.
Whenever Dave needs an address, he can select an integer index i and generate the associated public address (A_i, B_i) as:

m_i = H_s(T_addr, a, i)
A_i = a*(m_i + b)*G
B_i = (m_i + b)*G

Notice that A_i = a*B_i for any index i. This allows Dave to  calculate a DH shared secret using the same private key a for all his addresses.
4.3.2 Output construction

Let's suppose that Carol and Dave have agreed in advance on an amount v to be sent and Dave has provided his public address (A, B) constructed according to §4.3.1 for some index i unknown to Carol.

Carol can begin constructing the output for Dave by selecting a 128-bit nonce n uniformly at random. We will later explain how this nonce can be used for payment proof.
Carol then calculates a unique sending key s = H_s(T_send, A, B, v, n)
Carol derives a shared secret t = H_d(T_derive, s*A).
Carol constructs a one-time public key for Dave: K_o = H_s(T_outkey, t)*G + B. Only Dave knows the corresponding private key.
Carol feeds the shared secret t into a stream cipher to derive a blinding factor r and two encryption masks m_v and m_n.

Now Carol has all the pieces needed to form an MJ output:


field
description
size


C_o=r*G+v*H
amount commitment
32


range proof for C_o
576


K_o
one-time output key
32


K_e=s*B
key exchange public key
32


t[0]
view tag
1


v ⊕ m_v
encrypted amount
8


n ⊕ m_n
encrypted nonce
16


total
697


The "view tag" is the first byte of the shared secret t. This idea was first proposed for Monero and can reduce the time to test output ownership by at least 15% [10].
4.3.3 Output recognition

Dave can do the following for each unspent output in the blockchain:

Derive the nominal shared secret t' = H_d(T_derive, a*K_e)
If t'[0] != t[0], this output is not owned by Dave.
Calculate the nominal spend key B = K_o - H_s(T_outkey, t)*G
If B is not equal to any of Dave's generated spend keys, this output is not his.
Feed the shared secret t into a stream cipher to derive the blinding factor r and two decryption masks m_v and m_n to decrypt the amount v and nonce n.
Check that C_o ?= r*G + v*H
Look up the public view key A associated with B.
Calculate Carol's sending key: s = H_s(T_send, A, B, v, n)
Check that s*B ?= K_e

Note that none of these steps requires the private spend key b, so they can be done by an auditor on behalf of Dave. 99.6% of non-owned outputs will abort early in step 2 thanks to the view tag. Steps 7-9 are needed in order to prevent the Janus attack that can be used to link addresses that have the same private view key [11]. If the final check fails, Dave should not confirm the receipt of the payment and should not spend the output.
4.4 Ownership security after cut-through

To achieve an efficient blockchain, we'd like to cut-through spent outputs. However, transaction kernels don't prove ownership anymore because the blinding factors are shared between the sender and the receiver. Therefore, we need to retain some witness data for each spent output so that:

Ownership security can be retrospectively verified.
A payment proof can be constructed even after off-chain cut-through.


Observation 2


For each spent input, the input key K_i and the associated signature must be kept in the blockchain forever.


It would be nice to save blockchain space and put one aggregated signature with all the input keys into the transaction kernel. Unfortunately, this is not possible to do securely without explicitly linking inputs to the kernel (and thus revealing common input ownership). Since any of the K_i values may be rogue keys, the keys and signatures would have to be aggregated in a way that commits to all of the input keys in advance. The reader should refer to the Bellare-Neven multisignature scheme for details [12]. Note that MW doesn't have this limitation because the range proofs show that none of the output commitments is a rogue key, but this cannot be done non-interactively.
The best we can do is to keep K_i and a half-aggregated Schnorr signature separately for each transaction input.
However, this still has 2 problems:

What message should the input keys be used to sign? We cannot sign the outputs, since they are going to be cut-through after being spent (additionally, this would create an explicit link from inputs to outputs). If we sign a static message, this makes the transaction malleable and any malicious network participant can swap the transaction outputs for their own.
How can future verifiers be convinced that the key K_i in the list of spent inputs is indeed the key that was in the cut-through output?

This leads to the following observations:


Observation 3


The signatures with the input keys K_i must be cryptographically bound to the newly created outputs.


Observation 4


To enable cut-through, the signatures with the input keys K_i must be verifiable without access to the original transaction outputs.


We can achieve this without introducing linkability by homomorphically binding the spent inputs to the kernel and the kernel to the transaction outputs.
4.4.1 Homomorphic hash commitments

Given a transaction output out_j, we can define an output identifier ID_j = H_d(T_out, out_j) with the corresponding output key K_j.
We can bind the kernel to all transaction outputs by introducing a new field in the kernel, the output hash-commitment Z:

Z = H_p(ID₁, K₁) + H_p(ID₂, K₂) + ... + o_Z*G

where + is addition in group 𝔾 and o_Z is a random offset. Note that unlike the sum of ordinary hashes modulo 2²⁵⁶, the Wagner's algorithm cannot be used to generate a set of valid outputs given a commitment Z, because it would require solving the discrete logarithm problem in 𝔾 [13].
The homomorphic property of the hash commitment delinks the kernel from the outputs. When two transactions are coinjoined, the offsets o_Z are added together modulo q and it is no longer possible to infer which kernels commit to which outputs, but the sum of the kernels will still commit to all of the outputs. The same idea is used by the Elliptic Curve Multiset Hash [14].
The commitment Z can be signed by the kernel public key K to make transactions non-malleable.
4.4.2 Binding transaction inputs to the kernel

In MW, the kernel public key is defined as:

K_MW = C_o1 + C_o2 + ... - C_i1 - C_i2 - ... - o_K*G

where C_o are the output commitments, C_i the input commitments and o_K an offset.
In MJ, we have the kernel and a set of input Schnorr signatures (R_i1, s_i1), (R_i2, s_i2), ... with the input public keys K₁, K₂, ...
We can bind the inputs to the kernel by defining the kernel public key as:

K_MJ = K_MW - R_i1 - R_i2 - ...

Carol can still sign with this public key because she knows the private keys of all of the nonces R_i. The balance equation still holds because all of the nonces have the form R_i = r_i*G, which is proved by the input Schnorr signatures.
4.4.3 Verifiable input signatures

We can achieve verifiability by definining a spent input as a tuple (ID_i, K_i, R_i, s_i), where (R_i, s_i) is a Schnorr signature of ID_i with the input key K_i. The s_i values can be aggregated on a per-block basis, so each input will asymptotically require only 96 bytes of data to be kept in the blockchain.
Thanks to the homomorphic output commitment Z in each transaction kernel, verifiers can be convinced that all spent inputs correspond to previously created outputs by checking:

Σ_KERN Z_k ?= Σ_TXI H_p(ID_i, K_i) + Σ_UTXO H_p(ID_o, K_o) + o_Z*G

where Σ_KERN is a sum over all transaction kernels, Σ_TXI sum over all transaction inputs and Σ_UTXO sum over all unspent transaction outputs. Notice that checking this equation doesn't require the access to spent historical outputs.
4.5 Minglejingle transaction

An MJ transaction consists of:

A set of inputs, with each input being the tuple (ID_i, K_i, R_i, s_i), where ID_i may refer to the hash of an unspent output in the blockchain.
A transaction kernel consisting of the kernel public key K, the hash commitment Z and a signature (R_K, s_K) of the value Z.
One or more transaction outputs. The output format is described in §4.3.2.
Two offsets o_K and o_Z.

The two offsets and the values of s_i can be aggregated for the whole block, so the size of a 2-in 2-out transaction is 1714 bytes. A spent transaction will leave 320 bytes in the blockchain (128 bytes for the kernel and 96 bytes per input).
4.6 Validation rules

In order to verify the whole transaction history, a node needs:

All block headers.
The total money supply μ. This can be implicit from the block height or derived from the block headers.
The list of spent inputs (ID_i, K_i, R_i) for each block.
All transaction kernels (K, Z, R_K, s_K)
All unspent outputs.

Compared to Bitcoin, spent outputs are not needed for validation.
The five validation rules are the following:

The aggregated signature s_agg in each block is valid for all the spent inputs in that block.
All kernel signatures are valid.
The supply balance holds: Σ_UTXO C_o ?= μ*H + Σ_KERN K_k + Σ_TXI R_i + o_K*G
The input-output balance holds: Σ_KERN Z_k ?= Σ_TXI H_p(ID_i, K_i) + Σ_UTXO H_p(ID_o, K_o) + o_Z*G
All unspent outputs have valid range proofs.

Similar rules apply for mempool transaction validation except the input signatures are not aggregated yet and the μ*H term is replaced with the sum of the spent input commitments Σ_TXI C_i. The values of C_i can be looked up in the list of UTXOs based on ID_i. Note that some values of ID_i may refer to non-existing inputs, which indicates that off-chain cut-through has occured. Such inputs count as having a value of 0 (but they still must have valid signatures).
4.7 Payment proof

To prove that she sent money to Dave, Carol can provide an arbiter with:

Dave's address (A, B)
The transaction amount v
The one-time nonce n

The arbiter can verify Carol's claim by:

Calculating the sending key s = H_s(T_send, A, B, v, n)
Deriving the shared secret t = H_d(T_derive, s*A).
Constructing the corresponding one-time public key for Dave: K_d = H_s(T_outkey, t)*G + B

The arbiter can then check if an input or an output exists in the blockchain with the key K_d.
4.7.1 A spent input exists

If a spent input exists with this key, the arbiter is convinced that the payment has taken place. Dave's signature with the key K_d indisputably proves that Carol's output was correctly formed.
4.7.2 An unspent output exists

If an unspent output exists with the output key K_d, additional checks are needed to ensure that the output is correctly formed. The arbiter can follow the output construction procedure from §4.3.2 and validate the output. If the output is malformed (e.g. the amount commitment doesn't have the correct value, the view tag or the encrypted nonce are invalid etc.), blame can be assigned to Carol.
5. Security analysis

5.1 Linking addresses to a wallet

Given two addresses (A_i, B_i), (A_j, B_j) generated according to §4.3.1, deciding whether they belong to the same wallet requires solving the decisional Diffie-Hellman problem in 𝔾 [15]. This problem is considered intractable if the used elliptic curve has a large embedding degree, as is the case for most non-pairing-friendly curves such as ed25519.
An active attack may be attempted by Mallory if she suspects that (A_i, B_i), (A_j, B_j) are both owned by Dave. This requires her to deviate from the output construction process described in §4.3.2 by calculating the shared secret as t = H_d(T_derive, s*A_i), setting the key exchange public key to K_e=s*B_i and the output key to K_d = H_s(T_outkey, t)*G + B_j. However, Mallory won't be able to provide a nonce value n that passes the check in step 9 of §4.3.3, so Dave will be able to detect this attack and won't confirm the payment to Mallory. Mallory will not be able to provide a payment proof because in fact, she created an invalid output to address  (A_i, B_j).
5.2 Linking outputs to an address

Linking an output to an address requires either solving the discrete logarithm problem by calculating the receiver's private view key a or by guessing the sender's nonce value n (assuming the amount is known to the attacker). Both of these problems are intractable.
5.3 Breaking supply security

There are three approaches to breaking the balance equation §4.6.3:

Creating a valid Schnorr signature using H as the public key
Creating a valid Schnorr signature using H as the challenge nonce
Creating a valid range proof for a negative value.

Approaches 1. and 2. require solving the DLP in 𝔾, which is intractable. The third approach is intractable if the range proof is sound.
5.4 Breaking ownership security

There are three approaches for stealing an unspent output (ID_o, K_o), assuming the attacker knows the blinding factor of the output amount C_o:

Forging a signature with the key K_o.
Calculating the discrete logarithm of H_p(ID_o, K_o), replacing the output with (ID_attack, K_attack), adjusting the blockchain offset o_Z and performing a 51% attack to reorganize the blockchain beyond the cut-through horizon.
Generating an output (ID_attack, K_attack) such that  H_p(ID_attack, K_attack) = H_p(ID_o, K_o), replacing the original output and performing a 51% attack to reorganize the blockchain beyond the cut-through horizon.

All of these approaches involve solving intractable problems (the second and third are also impractical).
5.5 Replay attacks

MJ is susceptible to the following replay attack:

Mallory sends an output (ID₁, K₁) to Dave.
Some time later, Dave spends the output in transaction TX2.
Mallory again sends an identical output (ID₁, K₁) to Dave. This requires Mallory to use the same sender nonce and amount as before.
Anyone can replay the transaction TX2 to spend this output the same way Dave spent the first copy of the output.

A similar attack applies to Mimblewimble [16]. Solving this attack may not require consensus changes because Dave will be able to detect the duplicate output and won't provide any goods or services to Mallory for the payment. If Mallory tries to dispute the payment, the arbiter will clearly see the duplicate output keys in the blockchain, which can only occur in one of two cases (besides a negligible chance of a collision):

Mallory deliberately sent the duplicate payment.
Mallory is a victim of a longer chain of replayed transactions.

In the first case, the fault lies with Mallory. In the second case, someone else caused the replay attack, in which case Mallory actually didn't make the payment. In both cases, the arbiter may conclude that Dave is not obligated to provide any goods or services to Mallory.
5.6 Forging payment proofs

Given an output key K_o in the blockchain, Mallory can try to forge a payment proof with a value of v to the address (A,B):

If Mallory knows the private keys for K_o and B, she can forge the proof by calculating a hash preimage of k_o - b, another hash preimage to get the shared key, solving for the discrete logarithm of the Diffie-Hellman point with respect to A and finally, the last hash preimage to get the sending nonce n.
If Mallory doesn't know the private key for K_o or B, forging the proof requires one additional discrete logarithm calculation.

In both cases, forging the proof involves solving intractable problems.
6. Efficiency and usability analysis

6.1 Initial blockchain download size

The table below lists the initial blockchain download (IBD) size of MJ in comparison with other protocols at the 128-bit security level. This comparison assumes a blockchain with 600 million transactions with 2 inputs and 2 outputs each and 70 million unspent transaction outputs. For simplicity, we only count bare transaction sizes without block headers, transaction headers, explicit fees, lock times and aggregatable fields.


protocol
2-2 TX size
STXO size
IBD size (GB)
IBD size (visual)


Mimblewimble
1376
-640
100
**


Bitcoin
280

168
***


Minglejingle
1714
-697
241
*****


Bitcoin with CT
1016

610
************


Monero
1800

1080
**********************


6.2 Verification performance

The following table lists the number of Schnorr signatures and BP+ range proofs (in millions) that need to be verified for full IBD validation. Note that performance-wise, a range proof verification is equivalent to about 100 signatures. Protocols are ordered in ascending order of computational requirements.


protocol
signatures
rangeproofs
total CPU (visual)


Bitcoin
600
0
.


Bitcoin with CT
600
<70¹
*****


Mimblewimble
600
70
******


Minglejingle
1800
70
*******


Monero
48000²
600
***************************************************...***************


¹ Range proofs are aggregated and it's only necessary to verify the range proof of transactions with at least one unspent output.
² Assuming CLSAG with a ring size of 11 counting as 40 Schnorr signatures based on available benchmarks.
6.3 Usability and privacy


protocol
NITX
NICJ
CT
BLINK
TLINK


Bitcoin
X


Mimblewimble

X
X
X


Minglejingle
X
X
X
X


Bitcoin with CT
X

X


Monero
X

X
X
X


NITX: non-interactive transactions
NICJ: non-interactive coinjoin
CT: hidden transaction amounts
BLINK: blocks don't link inputs to outputs
TLINK: transactions don't link inputs to outputs

References

[1] Mimblewimble: https://github.com/mimblewimble/docs/wiki/MimbleWimble-Origin
[2] MW one-sided transactions: https://github.com/mimblewimble/grin-rfcs/blob/a2d9f25cdccf29904ef241df5b608ca3fd16ed61/text/0000-onesided.md
[3] One-sided transactions in MW: https://github.com/DavidBurkett/lips/blob/master/lip-0004.mediawiki
[4] MW non-interactive transactions: https://eprint.iacr.org/2020/1064
[5] Bulletproofs+: https://eprint.iacr.org/2020/735
[6] Diffie-Hellman: https://www.cs.utexas.edu/~shmat/courses/cs380s/dh.pdf
[7] Schnorr signatures: https://eprint.iacr.org/2012/029.pdf
[8] Ristretto: https://ristretto.group/what_is_ristretto.html
[9] Monero subaddresses: https://monerodocs.org/public-address/subaddress/
[10] Monero view tags: monero-project/research-lab#73
[11] Janus attack: https://web.getmonero.org/2019/10/18/subaddress-janus.html
[12] Multisignatures: https://cseweb.ucsd.edu/~mihir/papers/multisignatures-ccs.pdf
[13] Wagner's algorithm https://www.math.uni-frankfurt.de/~dmst/teaching/SS2017/wagner.pdf
[14] Elliptic Curve Multiset Hash https://arxiv.org/abs/1601.06502
[15] Decisional Diffie-Hellman: https://en.wikipedia.org/wiki/Decisional_Diffie%E2%80%93Hellman_assumption
[16] Grin replay attack: https://forum.grin.mw/t/enforcing-that-all-kernels-are-different-at-consensus-level/7368
primitive	elements	size (bytes)
private key, scalar	`Z(q)`	32
public key, commitment	`𝔾`	32
range proof (BP+)	15x `𝔾`, 3x `Z(q)`	576
2x aggregated BP+	17x `𝔾`, 3x `Z(q)`	640
Schnorr signature	1x `𝔾`, 1x `Z(q)`	64
field	description	size
`C_o=rG+vH`	amount commitment	32
	range proof for `C_o`	576
`K_o`	one-time output key	32
`K_e=s*B`	key exchange public key	32
`t[0]`	view tag	1
`v ⊕ m_v`	encrypted amount	8
`n ⊕ m_n`	encrypted nonce	16
	total	697
protocol	2-2 TX size	STXO size	IBD size (GB)	IBD size (visual)
Mimblewimble	1376	-640	100	`**`
Bitcoin	280		168	`***`
Minglejingle	1714	-697	241	`*****`
Bitcoin with CT	1016		610	`************`
Monero	1800		1080	`**********************`
protocol	signatures	rangeproofs	total CPU (visual)
Bitcoin	600	0	`.`
Bitcoin with CT	600	<70¹	`*****`
Mimblewimble	600	70	`******`
Minglejingle	1800	70	`*******`
Monero	48000²	600	`*************************************************`...`*************`