ZKM

Mar 10, 2025

1. Prerequisite Knowledge

1.1 Concepts
1.2 Circuits and Gates

2. In-Depth Analysis of zkVM

2.1 Overview
2.2 Virtual Machine (VM)

2.2.1 Two Main Architectures
2.2.2 RISC Instruction Set Architecture (ISA)

2.3 Implementing a Simple zkVM: Fibonacci Executor

2.3.1 Generating the Execution Trace
2.3.2 Building Polynomial Equations
2.3.3 Introducing Cyclicity
2.3.4 Polynomial Commitment

3. General zkVM Design Concepts

4. General Techniques

4.1 Host and Submachine
4.2 Lookup Scheme
4.3 Lookup Argument Schemes

4.3.1 Plookup
4.3.2 Caulk
4.3.3 Caulk+
4.3.4 Baloo
4.3.5 Flookup
4.3.6 cq
4.3.7 LogUp

4.4 Proof Aggregation and Recursion
4.5 Continuation Mechanism
4.6 zkVM Examples
4.7 zkVM Evaluation

4.7.1 Efficiency
4.7.2 Developer Toolchain

4.8 Application Scenarios

zkVM (Zero-Knowledge Virtual Machine) is a virtual machine that utilizes Zero-Knowledge Proofs (ZKP) to ensure the correctness, integrity, and privacy of computations.

1. Prerequisite Knowledge

1.1 Concepts

Zero-Knowledge Proof: A proof where the prover demonstrates to the verifier that a statement is true, without revealing any information beyond the truth of the statement. It includes the following four properties:

Zero-Knowledge
Succinctness
Non-Interactivity
Transparency

Validity Proof:

Used to prove the validity of state transitions. For example, zk-Rollups use validity proofs to demonstrate the legality of state transitions to the parent chain, typically in combination with proof systems like SNARKs and STARKs.

Fraud Proof:

Used to challenge the validity of transactions by the verifier.

Verifiable Computation:

Allows clients to outsource computation tasks of a function F to untrusted workers and verify the correctness of the results returned.

1.2 Circuits and Gates

When writing a ZKP application, the following three steps are involved:

Define the problem (constraint satisfaction problem) using a language

Here is an example of how to implement a multiplier using Circom.

Multiplier2.circom:

pragma circom 2.0.0;
/* This circuit template checks if c is the product of a and b. */
template Multiplier2 () {
   // Signal declarations
   signal input a;
   signal input b;
   signal output c;
   // Constraints
   c <== a * b;
}

The flowchart is as follows:

Arithmetization: Converting a program into a set of polynomials

Circuit Computation:
This method reduces the program to the gate-level concept. However, the gates here are not physical gates in a processor, but logical concept gates.‍

R1CS: The maximum polynomial degree is 2, satisfying the following equation:
\[\left(\sum_{k}A_{ik}Z_{k}\right)\left(\sum_{k}B_{ik}Z_{k}\right)-\left(\sum_{k}C_{ik}Z_{k}\right)=0\]

Where \(A_{ik'}, B_{ik'}, C_{ik}\) are elements from a finite field F. For example, the expression \(y = x^3\) can be represented by introducing intermediate variables as follows:
\[x \cdot x = w_1\]\[w_1 \cdot w_1 = w\]

Intermediate variables can be private or public inputs of the circuit. It is important to note that the R1CS representation of a computation is not unique. For example, the above expression could also be represented as:
\[x \cdot x = w_1\]\[w_1 \cdot x = w_2\]\[w_2 \cdot x = w_3\]‍

Plonkish: Supports two-input gates (two fan-in gates), allowing for the implementation of custom gates.
\[q_{L}x_{a}+q_{R}x_{b}+q_{o}x_{c}+q_{M}x_{a}x_{b}+q_{c}=0\]
Where \(q_{L}, q_{R}, q_{O}, q_{M}, q_{C}\) are control parameters for the selection operations, used to represent R1CS operations.

Computational Computation:
This method is similar to how computers work, involving a set of registers and state transition functions that modify the register values.

AIR and PAIR: Suitable for uniform computations.

CCS (Customizable Constraint System): CCS (Customizable Constraint System) is a general constraint representation method that supports R1CS, Plonkish, and AIR.
Simple Comparison

Define the Prover Instance, Generate Proof and Verifier
‍
Interactive Oracle Proof (IOP): IOP is an interactive proof protocol where the verifier does not need to read the entire message from the prover but can instead access any necessary symbols through an oracle. This allows the verifier’s runtime to be shorter than the total length of the proof (i.e., the sum of all message lengths).
‍
Polynomial Commitment Scheme: A cryptographic protocol used to commit to a polynomial and later verify the evaluation at specific points, preventing the prover from sending the entire witness w to the verifier.
‍
The general interactive protocol flow is as follows: 1. The prover commits to one or more polynomials using the specified polynomial commitment scheme. 2. The verifier randomly selects one or more field elements, sends them as challenges to the prover, and asks the prover to provide the evaluation values of the committed polynomials at these randomly chosen points (called the opening values). 3. This “question-and-answer” interaction can be repeated according to the number of openings required by the verifier, ensuring the reliability of the proof. 4. The verifier uses the relevant polynomial constraints to test the validity of the prover’s opening values.

The Fiat-Shamir method can be used to convert the interactive protocol into a non-interactive one.

KZG Commitment Scheme:

Source: https://youtu.be/xuGQYEvytxk?t=640

FRI Commitment Scheme (e.g., eSTARK):

Commitment Phase

The prover \(P\) submits a polynomial \(p_0\) over a multiplicative subgroup \(H\). The degree of the polynomial is \(d\), and all elements come from the field \(F\), where \(G\) is the generator of field \(F\), and \(K\) is the extension field of \(H\).

MTR is the root node of the Merkle tree. The verifier randomly generates a value and sends it to the prover, who then submits the MTR to the verifier as the opening value.

Proof Systems:‍

Groth16: e.g., rapidsnark, libsnark, arkworks
‍Plonk: e.g., Bellman, Halo2
‍EthStark

Publish Verifier on-chain:

Choose the appropriate elliptic curve on-chain: e.g., BN128, BLS12381

Question: Can we use high-level languages (such as Golang, Rust) to define problem models?

2. In-Depth Analysis of zkVM

2.1 Overview

2.2 Virtual Machine (VM)

In computing, a virtual machine (VM) is a virtualization or simulation of a computer system. A virtual machine is based on computer architecture and provides functionality similar to a physical computer.

Computer Architecture refers to the structure of a computer system composed of components like CPU, memory, ALU, etc.

2.2.1 Two Main Architectures

CPU
Memory
Registers
Bus
Input/Output Units

2.2.2 RISC Instruction Set Architecture (ISA)

RISC (Reduced Instruction Set Computer) is designed to simplify the instruction set used by computers when performing tasks.

Below is an example of the ADDI instruction in MIPS32, which allows for the selection of a source register, a target register, and includes a small constant.

In zkVM, there are three main RISC ISAs:

MIPS
RISC V
WASM

Among them, RISC V is similar to a simplified version of MIPS.

Source: Page 322, *Computer Organization and Design: The Hardware/Software Interface — RISC-V Edition*

2.3 Implementing a Simple zkVM: Fibonacci Executor

2.3.1 Generating the Execution Trace

Based on existing knowledge, we can implement a simple zkVM, such as a Fibonacci executor, to calculate the n-th term of the Fibonacci sequence.

\[f(n) = f(n-1) + f(n-2) \quad f(0) = f(1) = 1\]

We choose AIR (Algebraic Intermediate Representation) as the algebraization method.

Define a state machine \(S\) containing the following two variables, and let \(i\) be the field size:

\[S = (A_i, B_i) \quad S' = (A_{i+1}, B_{i+1})\]

Now, the state machine S can represent the Fibonacci executor as follows:

\[A_{i+1} = B_i \quad B_{i+1} = A_i + B_i\]

Thus, for all \(i\) in the field, the execution trace table is as follows, where \(n = 6\):

2.3.2 Building Polynomial Equations

In the virtual machine, each state is represented by a register.

The polynomials representing two registers come from the polynomial set \(F_p[X]\), where the coefficients are elements from the prime field \(F_p\) and \(p = 2^{64} - 2^{32} + 1\)

Thus, the domain forms a subgroup:

\[H = \{\omega_0, \omega_1, \omega_2, ..., \omega_d = \omega_0\} \subset F_p^d\]

Define two polynomials \(P(X)\) and \(Q(X)\) to represent the trace columns A and B:

\[P(\omega^{i})=A_{i}Q(\omega^{i})=B_{i}\]

It is clear that:

\[P(\omega^{i}\cdot\omega)=P(\omega^{i+1})=A_{i+1}\]

\[Q(\omega^{i}\cdot\omega)=Q(\omega^{i+1})=B_{i+1}\]

Using Lagrange interpolation, we can compute the coefficient form of \(P\) and \(Q\).

Now, we can use polynomial constraints to limit the state transitions with two registers:

\[P(X \cdot \omega) = I_H Q(X)\]

\[Q(X \cdot \omega) = I_H P(X) + Q(X)\]

2.3.3 Introducing Cyclicity

Since \(H\) is a subgroup, we have:

\[P(\omega^{5}\cdot\omega)=I_{H}Q(\omega^{5})=13\]

\[P(\omega^{5}\cdot\omega)=P(\omega^{0})=1\]

Clearly, the constraint is broken in the last row.

The solution is to introduce another register LAST to mark the last row and assign it a vector value \([0, ..., 1]\). The trace table becomes:

\[P(X \cdot \omega) = I_H Q(X) \cdot (1 - \text{LAST}(X)) + \text{LAST}(X) \cdot P(\omega^0)\]

\[Q(X \cdot \omega) = I_H (P(X) + Q(X)) \cdot (1 - \text{LAST}(X)) + \text{LAST}(X)\]

In summary, there are two basic types of constraints in the state machine:

State Transition Constraints: For registers like \(P\) and \(Q\).
Boundary Constraints: For registers like LAST.

In actual zkVM development, there are various types of instructions, such as arithmetic, logical, branching/jumping, and memory operations. Each instruction requires only a small amount of state to represent, but a program may require thousands of states.

Below is an overview of the zkMIPS implementation:

2.3.4 Polynomial Commitment

Now, the constraints mentioned above can be transformed into polynomial equalities:

\[P_0 = (1 - \text{LAST}(X)) \cdot (P(X \cdot \omega) - Q(X)) = 0\]

\[P_1 = (1 - \text{LAST}(X)) \cdot (Q(X \cdot \omega) - (P(X) + Q(X))) = 0\]

The proof system based on polynomial equalities utilizes the fundamental properties of the Schwartz-Zippel Lemma.

According to the Schwartz-Zippel Lemma, the probability that the prover finds a false polynomial Q' (with degree d) that satisfies:

\[Q'(\alpha_j) = Q(\alpha_j) \quad \text{for all } j \in \{1, 2, ..., l\}\]

for random challenges \(\{\alpha_1, \alpha_2, ..., \alpha_i\}\) from the verifier is at most \(\frac{d}{|S|}\), which is negligible.

Define \(z_H(X)\) as the vanishing polynomial, and compute the quotient polynomials \(q_i(X)\):

\[P_0 = (1 - \text{LAST}(X)) \cdot (P(X \cdot \omega) - Q(X)) = z_H(X) \cdot q_i(X)\]

\[P_1 = (1 - \text{LAST}(X)) \cdot (Q(X \cdot \omega) - (P(X) + Q(X)))\]

Therefore, we can use polynomial commitment schemes (such as FRI or KZG) to commit to the trace and quotient polynomials \(R_i, q_i(X)\).

3. General zkVM Design Concepts

Generally, there are three main types of zkVMs:

4. General Techniques

4.1 Host and Submachine

In modern Instruction Set Architectures (ISA), virtual machines consist of various components, such as CPU, ALU, memory, I/O, bus, and other peripherals.
Different components handle different instructions and communicate via the bus.

Taking zkMIPS as an example, we have implemented 62 instructions and categorized them as follows:

Arithmetic and Arithmetic Immediate: such as ADD, ADDI, etc.
Logical and Logical Immediate: such as AND, ANDI, etc.
Shift and Shift Immediate: such as SLL, SLLV, etc.
Load and Store: such as LB, SB, etc.
Jump/Offset Jump/Branch: such as JUMP, JUMPI, etc.
System Calls: such as Read, Write, etc.

Thus, if we use a single trace table and place all state variables in it, the table becomes very large and sparse (i.e., many zero-value entries). This leads to the following issues:

A very large number of columns ⇒ Many trace polynomials to commit to
A very large number of rows ⇒ Very large domain size, and high degrees for the trace polynomials

Both of these points lead to a substantial computational load in the Polynomial Commitment Scheme (PCS).

A common optimization method is to split the large table into multiple sub-tables, as shown below:

This splitting brings additional engineering benefits:

Reduced memory and CPU consumption
Support for parallel proofs
Modularity: Different engineers can focus on the implementation and optimization of different circuits

4.2 Lookup Scheme

Lookup arguments allow proof that the vector elements submitted by the prover come from another (larger) committed table. Such schemes are commonly used to implement the communication bus in zkVMs. Broadly speaking, these protocols can be used to prove statements of the following form:

Given a table \(T = \{t_i\}, \quad i = 0, ..., N-1\) where all values are distinct (referred to as “rows”),
And a lookup list \(F = \{f_j\}, \quad j = 0, ..., m-1\) (where values may repeat),
All lookup values are contained within the table, i.e., \(F \subseteq T\).

In this framework, the table \(T\) is typically considered public, and the lookup list \(F\) is treated as private evidence. Table \(T\) can be understood as storing all valid values for a particular variable, while the lookup list \(F\) contains the specific instances of this variable generated during program execution. The proven statement asserts that throughout program execution, the variable always remains within its valid range.

In this discussion, we assume \(m < N\), and typically \(m \ll N\) (unless stated otherwise). We will review the evolution of lookup protocols and their various applications.

Before delving into lookup schemes, let’s first examine a permutation parameter using multiset equality:

\[\prod_{i}(X-f_{i})=\prod_{j}(X-t_{j})\]

where \(X\) belongs to the field \(F\). We can choose a random number \(\alpha \in F\) and simplify the check of the above polynomial equality into a grand product.

Moreover, if \(F \subseteq T\), there exists \(m_j\) such that:

\[\prod_{i}(x-f_{i})=\prod_{j}(x-t_{j})^{m_{j}}\]

In particular, if all \(m_j\) are 1, then we have a multiset equality problem. This may seem to solve the problem, but the computational complexity is related to the size of set \(T\), which could be very large.

Let us now dive deeper into the history of lookup schemes.

4.3 Lookup Argument Schemes

4.3.1 Plookup

Plookup is one of the earliest lookup protocols. The prover’s computational complexity is \(O(N \log N)\) field operations, and the protocol can be generalized to support multi-table and vector lookups. The process involves sorting the elements of vector f and table t in ascending order and defining:

\[\{(s_{k'},s_{k+1})\}=\{(t_{i},t_{j+1})\}\cup\{(f_{i},f_{i+1})\}\]

as multisets, and then performing the following check:

\[\prod_{k}(X+s_{k}+Y\cdot s_{k+1})=\prod_{i}(X+f_{i}+Y\cdot f_{i+1})\prod_{j}(X+t_{j}+Y\cdot t_{j+1})\]

where \(X\) and \(Y\) belong to the field \(F\).

This protocol can be simplified further using a grand product.

4.3.2 Caulk

Caulk makes the prover’s workload dependent on the size \(m\) of \(C_{M}\) rather than the size \(N\) of \(C\). The prover identifies a subset \(C_{i}\) (which contains elements of \(C_{M}\)) and uses a KZG commitment to prove that \(C - C_i = Z_i(x) \cdot H_i(x)\). This protocol has sublinear efficiency, with a prover complexity of \(O(m \log N)\).

4.3.3 Caulk+

Caulk+ is an improved version of Caulk that reduces the prover’s computational complexity further by more efficient divisibility checks. The protocol proves that \(Z_i\) can divide \(C - C_i\) and \(x^n - 1\) by computing the polynomials \(Z_{I}\), \(C_{i}\), and \(U\). With the introduction of blinding factors to preserve zero-knowledge properties, the prover’s complexity is reduced to \(O(m^2)\).

4.3.4 Baloo

Baloo extends Caulk+ by committing to subsets of the table in the form of vanishing polynomials, achieving nearly linear time complexity for the subset size. It introduces a “commit and prove” protocol and uses the general Sumcheck protocol to reduce the proof to an inner product argument. This protocol is efficient and supports multi-column lookups, showing wide applicability in SNARKs such as zkEVM.

4.3.5 Flookup

Flookup is designed for efficiently proving that the value of a committed polynomial belongs to a large table. The protocol leverages pairings to extract vanishing polynomials for related table subsets and introduces a new interactive proof system (IOP). After \(O(N \log^2 N)\) preprocessing, the prover operates in nearly linear time \(O(m \log^2 m)\), offering substantial improvements over previous quadratic complexities, particularly for SNARKs over large fields.

4.3.6 cq

cq uses logarithmic derivative methods to simplify membership proofs to rational polynomial equality checks. By precomputing cached quotients for quotient polynomials, the protocol makes table item computations more efficient. The prover time is \(O(N \log N)\), and the proof size is \(O(N)\), offering better efficiency than Baloo and Flookup while maintaining homomorphic properties.

4.3.7 LogUp

LogUp efficiently proves that a set of witness values exists within a Boolean hypercube lookup table. By using logarithmic derivatives, this method transforms the set inclusion problem into rational function equality checks, requiring the prover to provide only a multiplicity function. LogUp is more efficient than multi-variable Plookup variants, requiring 3-4 times fewer Oracle commitments. For large-scale lookups, its efficiency also surpasses the bounded multiplicity optimization of Plookup. This method is well-suited for vector-valued lookups and scalable for range proofs, playing a crucial role in SNARKs (such as PLONK and Aurora) and applications like tinyRAM and zkEVM.

Further reading: Lookup Protocol Paper and Video Link

4.4 Proof Aggregation and Recursion

Aggregation is the simplest way to generate individual proofs for each block and combine them into another proof. The first type of proof verifies “the block is valid and on-chain”, while the second type verifies “all these block proofs are valid.” This method is known as aggregation.

In an aggregation scheme, the second circuit can only start aggregation after all block proofs are ready. However, what if we could process the blocks one by one? This is what recursion accomplishes.

In a recursive scheme, each block proof verifies two things: the current block is valid, and the previous proof is valid. One proof “wraps” the previous proof, with the overall structure resembling a chain.

4.5 Continuation Mechanism

Continuation is a mechanism for splitting a large program into smaller segments that can be executed and proven independently. This mechanism provides the following benefits:

Parallel proof generation
Enabling checkpoint functionality in zkVM
Limiting memory requirements to a fixed size, regardless of program size

Segment: A segment is a sequence of MIPS traces with entry PC and image ID.

Check Memory Consistency

4.6 zkVM Examples

4.7 zkVM Evaluation

4.7.1 Efficiency

To estimate the time required to generate a proof for program \(P\), we need to consider:

The number of instructions in program \(P\).
The complexity of constraints.

And use the following formula to calculate the ISA efficiency:

\[\text{#Instructions} \times \frac{\text{Constraint Complexity}}{\text{Per Instruction}} \times \frac{\text{Time}}{\text{Constraint Complexity}}\]

Source: Video Link

Method for Evaluating zkVM Performance:

Test scenarios: SHA2, SHA3, Rust EVM, Rust Tendermint, etc.

Metrics:

Proving cycles per second (Hz).
Proving energy cost per gas.

Open Source Frameworks:

Including RISCZero, zkMIPS, SP1, and Jolt, with benchmarks available at zkMIPS/zkvm-benchmarks.

4.7.2 Developer Toolchain

zkVM should provide a developer-friendly toolchain that allows developers to build, compile their programs, and deploy validators onto the blockchain.

Correctness

The VM must execute programs as expected.

The proof system must meet its stated security properties, such as:

Soundness
Completeness
Zero Knowledge

Conciseness

Proofs generated by the zkVM should be verifiable on-chain.
Smaller proof sizes, shorter verification times, and transparent setups are desirable features.

4.8 Application Scenarios

Hybrid Rollup: Optimistic + ZK
‍
Metis combines the flexibility and composability of Optimistic Rollups with the scalability of zkMIPS to form a unified protocol, reducing final confirmation time from 7 days to less than 1 hour.

Bitcoin L2:
‍
The GOAT network implements a BitVM2 protocol based on zkMIPS, building a truly secure Bitcoin L2.

zk Identity (zkIdentity)

zk Machine Learning (zkML)

Subscribe to the ZKM Blog to stay updated with the latest research by ZKM. If you have any questions about the article, you can contact the ZKM team on discord: discord.com/zkm

H1

h2

‍

h3

‍

H4

H5

What’s a Rich Text element?

The rich text element allows you to create and format headings, paragraphs, blockquotes, images, and video all in one place instead of having to add and format them individually. Just double-click and easily create content.

Static and dynamic content editing

A rich text element can be used with static or dynamic content. For static content, just drop it into any page and begin editing. For dynamic content, add a rich text field to any collection and then connect a rich text element to that field in the settings panel. Voila!

How to customize formatting for each rich text

Headings, paragraphs, blockquotes, figures, images, and figure captions can all be styled after a class is added to the rich text element using the "When inside of" nested selector system.

aasdasd

Experience in the fast-paced and dynamic work culture of an international team
Excellent growth opportunity as your role expands in direct relation to the growth of your region’s participation/involvement/adoption
Learn about the Metis ecosystem and its products

1. Metis Technical Advocate
2. Metis Community Advocate

// Get all <p> elements in the document
const paragraphs = document.querySelectorAll('p');

// Loop through each paragraph to check if it's preceded by an <h1> or <h2>
paragraphs.forEach(paragraph => {
    // Get the previous sibling element of the current paragraph
    const previousElement = paragraph.previousElementSibling;
    
    // Check if the previous element is an <h1> or <h2>
    if (previousElement && (previousElement.tagName === 'H1' || previousElement.tagName === 'H2')) {
        // Change the margin of the current paragraph
        paragraph.style.marginTop = '20px'; // Adjust the margin value as needed
    }
});

‍

Headings, paragraphs,

‍

Headings, paragraphs,

zkMIPS Beta: A Competitive Performance Report

In this work, we begin to publish a general and fair zkVM benchmark framework based on previous work by a16z, providing a comparison on proving time and energy cost between ZKM (zkMIPS) and other zkVM projects, like RISC Zero (R0) and SP1.

Cross-chain Asset Transfer Without a Bridge - Part One

To accomplish cross-chain asset transfer, most of the solutions currently available are based on a bridge, a separate, intermediate entity, which is typically trusted with holding these assets during some period of the transaction. This trust assumption is undesirable since it provides a large opportunity for attack. In this post I will explain that, assuming the existence of a zkRollup, one can implement cross-chain asset transfer without the need for additional trust assumptions (such as a bridge).

1. Prerequisite Knowledge

1.1 Concepts
1.2 Circuits and Gates

2. In-Depth Analysis of zkVM

2.1 Overview
2.2 Virtual Machine (VM)

2.2.1 Two Main Architectures
2.2.2 RISC Instruction Set Architecture (ISA)

2.3 Implementing a Simple zkVM: Fibonacci Executor

2.3.1 Generating the Execution Trace
2.3.2 Building Polynomial Equations
2.3.3 Introducing Cyclicity
2.3.4 Polynomial Commitment

3. General zkVM Design Concepts

4. General Techniques

4.1 Host and Submachine
4.2 Lookup Scheme
4.3 Lookup Argument Schemes

4.3.1 Plookup
4.3.2 Caulk
4.3.3 Caulk+
4.3.4 Baloo
4.3.5 Flookup
4.3.6 cq
4.3.7 LogUp

4.4 Proof Aggregation and Recursion
4.5 Continuation Mechanism
4.6 zkVM Examples
4.7 zkVM Evaluation

4.7.1 Efficiency
4.7.2 Developer Toolchain

4.8 Application Scenarios

zkVM (Zero-Knowledge Virtual Machine) is a virtual machine that utilizes Zero-Knowledge Proofs (ZKP) to ensure the correctness, integrity, and privacy of computations.

1. Prerequisite Knowledge

1.1 Concepts

Zero-Knowledge
Succinctness
Non-Interactivity
Transparency

Validity Proof:

Used to prove the validity of state transitions. For example, zk-Rollups use validity proofs to demonstrate the legality of state transitions to the parent chain, typically in combination with proof systems like SNARKs and STARKs.

Fraud Proof:

Used to challenge the validity of transactions by the verifier.

Verifiable Computation:

Allows clients to outsource computation tasks of a function F to untrusted workers and verify the correctness of the results returned.

1.2 Circuits and Gates

When writing a ZKP application, the following three steps are involved:

Define the problem (constraint satisfaction problem) using a language

Here is an example of how to implement a multiplier using Circom.

Multiplier2.circom:

pragma circom 2.0.0;
/* This circuit template checks if c is the product of a and b. */
template Multiplier2 () {
   // Signal declarations
   signal input a;
   signal input b;
   signal output c;
   // Constraints
   c <== a * b;
}

The flowchart is as follows:

Arithmetization: Converting a program into a set of polynomials

Circuit Computation:
This method reduces the program to the gate-level concept. However, the gates here are not physical gates in a processor, but logical concept gates.‍

R1CS: The maximum polynomial degree is 2, satisfying the following equation:
\[\left(\sum_{k}A_{ik}Z_{k}\right)\left(\sum_{k}B_{ik}Z_{k}\right)-\left(\sum_{k}C_{ik}Z_{k}\right)=0\]

Computational Computation:
This method is similar to how computers work, involving a set of registers and state transition functions that modify the register values.

AIR and PAIR: Suitable for uniform computations.

CCS (Customizable Constraint System): CCS (Customizable Constraint System) is a general constraint representation method that supports R1CS, Plonkish, and AIR.
Simple Comparison

Define the Prover Instance, Generate Proof and Verifier
‍
Interactive Oracle Proof (IOP): IOP is an interactive proof protocol where the verifier does not need to read the entire message from the prover but can instead access any necessary symbols through an oracle. This allows the verifier’s runtime to be shorter than the total length of the proof (i.e., the sum of all message lengths).
‍
Polynomial Commitment Scheme: A cryptographic protocol used to commit to a polynomial and later verify the evaluation at specific points, preventing the prover from sending the entire witness w to the verifier.
‍
The general interactive protocol flow is as follows: 1. The prover commits to one or more polynomials using the specified polynomial commitment scheme. 2. The verifier randomly selects one or more field elements, sends them as challenges to the prover, and asks the prover to provide the evaluation values of the committed polynomials at these randomly chosen points (called the opening values). 3. This “question-and-answer” interaction can be repeated according to the number of openings required by the verifier, ensuring the reliability of the proof. 4. The verifier uses the relevant polynomial constraints to test the validity of the prover’s opening values.

The Fiat-Shamir method can be used to convert the interactive protocol into a non-interactive one.

KZG Commitment Scheme:

FRI Commitment Scheme (e.g., eSTARK):

Commitment Phase

MTR is the root node of the Merkle tree. The verifier randomly generates a value and sends it to the prover, who then submits the MTR to the verifier as the opening value.

Proof Systems:‍

Groth16: e.g., rapidsnark, libsnark, arkworks
‍Plonk: e.g., Bellman, Halo2
‍EthStark

Publish Verifier on-chain:

Choose the appropriate elliptic curve on-chain: e.g., BN128, BLS12381

Question: Can we use high-level languages (such as Golang, Rust) to define problem models?

2. In-Depth Analysis of zkVM

2.1 Overview

2.2 Virtual Machine (VM)

Computer Architecture refers to the structure of a computer system composed of components like CPU, memory, ALU, etc.

2.2.1 Two Main Architectures

CPU
Memory
Registers
Bus
Input/Output Units

2.2.2 RISC Instruction Set Architecture (ISA)

RISC (Reduced Instruction Set Computer) is designed to simplify the instruction set used by computers when performing tasks.

Below is an example of the ADDI instruction in MIPS32, which allows for the selection of a source register, a target register, and includes a small constant.

In zkVM, there are three main RISC ISAs:

MIPS
RISC V
WASM

Among them, RISC V is similar to a simplified version of MIPS.

2.3 Implementing a Simple zkVM: Fibonacci Executor

2.3.1 Generating the Execution Trace

Based on existing knowledge, we can implement a simple zkVM, such as a Fibonacci executor, to calculate the n-th term of the Fibonacci sequence.

\[f(n) = f(n-1) + f(n-2) \quad f(0) = f(1) = 1\]

We choose AIR (Algebraic Intermediate Representation) as the algebraization method.

Define a state machine \(S\) containing the following two variables, and let \(i\) be the field size:

\[S = (A_i, B_i) \quad S' = (A_{i+1}, B_{i+1})\]

Now, the state machine S can represent the Fibonacci executor as follows:

\[A_{i+1} = B_i \quad B_{i+1} = A_i + B_i\]

Thus, for all \(i\) in the field, the execution trace table is as follows, where \(n = 6\):

2.3.2 Building Polynomial Equations

In the virtual machine, each state is represented by a register.

The polynomials representing two registers come from the polynomial set \(F_p[X]\), where the coefficients are elements from the prime field \(F_p\) and \(p = 2^{64} - 2^{32} + 1\)

Thus, the domain forms a subgroup:

\[H = \{\omega_0, \omega_1, \omega_2, ..., \omega_d = \omega_0\} \subset F_p^d\]

Define two polynomials \(P(X)\) and \(Q(X)\) to represent the trace columns A and B:

\[P(\omega^{i})=A_{i}Q(\omega^{i})=B_{i}\]

It is clear that:

\[P(\omega^{i}\cdot\omega)=P(\omega^{i+1})=A_{i+1}\]

\[Q(\omega^{i}\cdot\omega)=Q(\omega^{i+1})=B_{i+1}\]

Using Lagrange interpolation, we can compute the coefficient form of \(P\) and \(Q\).

Now, we can use polynomial constraints to limit the state transitions with two registers:

\[P(X \cdot \omega) = I_H Q(X)\]

\[Q(X \cdot \omega) = I_H P(X) + Q(X)\]

2.3.3 Introducing Cyclicity

Since \(H\) is a subgroup, we have:

\[P(\omega^{5}\cdot\omega)=I_{H}Q(\omega^{5})=13\]

\[P(\omega^{5}\cdot\omega)=P(\omega^{0})=1\]

Clearly, the constraint is broken in the last row.

The solution is to introduce another register LAST to mark the last row and assign it a vector value \([0, ..., 1]\). The trace table becomes:

\[P(X \cdot \omega) = I_H Q(X) \cdot (1 - \text{LAST}(X)) + \text{LAST}(X) \cdot P(\omega^0)\]

\[Q(X \cdot \omega) = I_H (P(X) + Q(X)) \cdot (1 - \text{LAST}(X)) + \text{LAST}(X)\]

In summary, there are two basic types of constraints in the state machine:

State Transition Constraints: For registers like \(P\) and \(Q\).
Boundary Constraints: For registers like LAST.

Below is an overview of the zkMIPS implementation: