Elliptic Curve Digital Signature Algorithm: Math Behind Bitcoin | Subir Chowdhuri

Published:: Jun 27, 2017

Last updated:: Jul 11, 2017

Most Bitcoin implementations skip over the mathematical details - for good reason! The gory details are not for the faint of heart! However, those who dare wade through the myriad equations, gain unparalleled clarity of the underlying concepts. I myself emerged on the other side, marveling at the beauty of the math behind Bitcoin; fascinated by the elegance of the implementation. I wonder why schools never discuss real-world use-cases such as this.

Throughout this article, I will be assuming basic understanding of Bitcoin and the blockchain. If you are new to Bitcoin or the concept of a blockchain, you should rather start off with few of the several non-mathematical treatments available online.

Ownership of Bitcoin

Bitcoins only exist as “transactions” (transfer of value from one address to another) in the blockchain. Each transaction can be traced back to one or more transactions where the bitcoins originated - generated by the system. To own a certain amount of bitcoins means possessing the ability to be able to transfer ownership (and hence, the bitcoins themselves) to another address. The sender must create a “digitally signed” transaction, which, if verified by others (the entire network can participate in this process), is added to the blockchain for all of eternity. This is where ECDSA comes in.

ECDSA

Elliptic Curve Digital Signature Algorithm is used to create digital signatures for data. A digital signature is similar to its real world counterpart - easy to read and identify the owner, but impossible to forge.

ECDSA is based on the mathematical concepts of Elliptic Curves and Finite Fields.

Elliptic Curve

An elliptic curve is of the form:

y^2 = x^3 + ax + b

Examples:

Bitcoin sets a=0, b=7. Here is how Bitcoin's elliptic curve looks like:

Properties of the Elliptic Curve

If a line intersects two points on the curve, it will intersect exactly one more point
A line tangent to the curve will intersect exactly one point on the curve

What about vertical lines (slope = INF)? Note that the slope of the elliptic curve goes on increasing beyond the point of inflection (the point where the direction of curvature changes). It can be implied from this that the slope becomes INF at some point. Hence, it can be said that a vertical line intersects the curve at INF.

Point Addition

Adding two points on the curve, ie: P1 + P2 = P3

If the line P1 P2 intersects the curve at point P3’ (from property #1 above), then P3 is the reflection of P3’ on the x-axis:

The line P1P2 is described by:

Where, the slope of the line is:

Plugging the value of y in the elliptic curve equation:

We arrive at a polynomial equation of the third degree. And,

With x3 known, we need y3:

Note: The point P3(x3, y3) is actually the reflection of P3’(x3, y3’)

Point Doubling

Adding the same point to itself: P1 + P1 = P2

The tangent at point P1 intersects the curve at a point P2’ (from property #2). The reflection of point P2’ on the x-axis gives us P2:

We can use the results from point addition:

The slope of the tangent can be found by differentiation:

Hat-tip to Baidyanath for helping me understand this part!

Scalar Multiplication

Multiplying a point P by a number k:

R = k * P

This can be broken down into a point doubling and addition steps. Example:

3 * P
= P + P + P
= P + (P + P)

3P was thus broken down into one point-addition and one point-doubling step.

Elliptic Curve in Finite Fields

A finite field is a set of finite elements. To define our elliptic curve over a finite field:

Limit values to integers instead of real numbers
Compute all results modulo p; where p is a prime number
Follow modular arithmetic for all operations

So, if our elliptic curve is

y^2 = x^3 + 10x

In context of a finite field, with p = 67, it becomes:

y^2 = (x^3 + 10x) mod 67

The plot of the curve changes significantly, but the characteristic properties are retained:

Note the unmistakable symmetry with respect to y=p/2. What previously was a continuous curve is now a set of disjoint points in the xy-plane

Parameters of the Elliptic Curve

The elliptic curve is described by certain parameters:

the elliptic curve equation, defined by the coefficients (a, b)
prime modulo p
base point

Bitcoin uses Secp256k1 which sets a=0 and b=7 to give us the equation:

y^2 = x^3 + 7

Prime modulo

  2^256 – 2^32 – 2^9 – 2^8 – 2^7 – 2^6 – 2^4 - 1
  = FFFFFFFF FFFFFFFF FFFFFFFF FFFFFFFF FFFFFFFF FFFFFFFF FFFFFFFE FFFFFC2F

Point of Origin (G)

  04 79BE667E F9DCBBAC 55A06295 CE870B07 029BFCDB 2DCE28D9 59F2815B 16F81798
  483ADA77 26A3C465 5DA4FBFC 0E1108A8 FD17B448 A6855419 9C47D08F FB10D4B8

Creating the Private and Public Key pair

The private key is a randomly chosen 20-byte number. The public key is a point on the bitcoin elliptic curve generated from the point multiplication of G with the private key.

K(pub) = K(pvt) * G

Signing a message

Select a random number k such that 1 <= k <= n-1
Calculate the point P = k * G. That point's x value will represent R
Calculate S like so:

S = k^-1 (z + Kpvt * R) mod p

Where z is the SHA256 hash of the message, p is the prime modulo, Kpvt is the private key and k is the random number from step 1 above. k^-1 is the modular multiplicative inverse of k

The pair (R, S) together form the digital signature.

Verifying a signature

To verify a digital signature (R, S), you need the public key and the curve parameters. Plug the values in this equation:

P =  S^-1 * z * G + S^-1 * R * Kpub

If the signature is valid, the x coordinate of point P will be equal to R.