Breakdown: simple-blind-arbitrage
This guide is based off of simple-blind-arbitrage. Before you continue with this guide, we recommend reading the README for a technical overview of the system. If this raises more questions than it answers, that's OK! This guide will break down each component of the bot in detail.
We'll start by explaining the core concept of this bot's strategy: onchain searching.
Onchain Searching
Our goal is to turn a profit by backrunning pending transactions from MEV-Share. The "backrun transaction" is an arbitrage: we buy tokens on one exchange and sell them on another for a different price. Ideally, the difference in price will allow us to take a profit.
MEV-Share introduces some key differences from more common strategies that you may have seen elsewhere (e.g. simple-arbitrage, subway, rusty-sando). The main difference is that pending transactions typically expose less data, compared to transactions in the public mempool. Transaction signatures are always hidden from searchers. Simulation-based strategies (e.g. rusty-sando) on these transactions are usually not possible, since the amount traded by the user is typically hidden. That being said, users can choose to reveal more data to searchers, so all the classic strategies can still be used; they'll just land less often.
The strategy we'll use is called "onchain" searching: we calculate how much to trade within the "trade" itself, effectively executing the searching strategy & algorithm on the blockchain ("onchain"). We send a backrun for every transaction that touches the tokens we're interested in trading, and rely on Flashbots to prevent unprofitable trades from landing on-chain (formerly known as "being mined"). The amount we buy & sell in our arbitrage trades is derived from the prices of the assets on the blockchain at the time of execution. Because we place our transaction behind another user's transaction ("backrunning"), the price that our transaction sees is the price which has been changed by the user's trade. This is where we get our arbitrage opportunity.
Arbitrage Contract
We'll start by looking at a ready-made smart contract, and then break it down piece by piece.
This is the core logic contract, which contains functions for performing arbitrage between Uniswap-V2-like exchanges (e.g. UniV2 / Sushiswap). Later on, we'll create other contracts that inherit this one, so that we can add custom asset management logic (flash loans, where to store profits, etc.) without having to rewrite all the Uniswap-centric logic, which you likely won't need to change.
// Loading https://raw.githubusercontent.com/flashbots/simple-blind-arbitrage/main/src/BlindBackrunLogic.sol ...
This may look complicated, but by the end we'll have explained every line of code. We'll start at the top with Imports & Interfaces.
Imports & Interfaces
We start by importing some contract interfaces openzeppelin/access/Ownable.sol
and ./IWETH.sol
. Ownable allows us to restrict certain functions to the contract owner. IWETH allows us to deposit/withdraw ETH for WETH. We need WETH because Uniswap (V2/V3) only supports ERC20 tokens.
We also define a couple interfaces ourselves: IUniswapV2Pair
and IPairReserves
. We could import these from the official Uniswap contract library like we did with OpenZeppelin for the Ownable contract, but that comes with a lot of bloat for our project. In this case, we only need four functions from IUniswapV2Pair
, and the struct definition of PairReserves
from IPairReserves
.
Defining these interfaces allows us to interact with other smart contracts directly, as we'll see in the next sections.
Abstract Contract
It's important to remember that this is an abstract contract, meaning that to use it, we'll need to write another smart contract that extends it (using the is
keyword). That contract is responsible for writing the capital-management strategy; where to keep money, where/how to get it; as well as any other custom logic required for their specific strategy. We'll do a walkthrough of two different finished implementations after we break down the core logic contract. Read on to learn how our arbitrage algorithm works.
Arbitrage Algorithm
To illustrate what the algorithm does, we plot profit (in ETH) from an arbitrage, where we buy amount_in
tokens for WETH on one exchange and sell them all for WETH on another.
As you can see, the optimal amount_in to buy is approximately 35 ETH, but that's just eyeballing. How do we calculate the exact optimal point? To figure that out, first we need to figure out how to pre-calculate the outcome of a single swap. Then we'll compose two swaps together to calculate the outcome of an arbitrage. (This is how we made that chart!)
The amount of tokens we get out from one swap is defined with the following function:
- let =
FEE
= 997 - let =
FEE_DIVISOR
= 1000 - let =
reserveOut
- this refers to the reserves of the token that we're getting out of the trade
- let =
reserveIn
- this refers to the reserves of the token that we're paying into the trade
This is implemented in the getAmountOut
function in our smart contract. It's adapted from the UniswapV2 Library contract; we just removed the safety checks to save gas. We don't need guard rails since Flashbots will prevent reverting transactions from landing onchain.
function getAmountOut(
uint amountIn,
uint reserveIn,
uint reserveOut
) internal pure returns (uint amountOut) {
uint amountInWithFee = amountIn * 997;
uint numerator = amountInWithFee * reserveOut;
uint denominator = reserveIn * 1000 + amountInWithFee;
amountOut = numerator / denominator;
return amountOut;
}
If we calculate the from one pair, and set that as the input to another trading pair's function, then we get the ETH proceeds from an arbitrage. Subtract the original amount in () and that number is our gross profit.
The following function calculates the optimal trade amount such that gross profit is as high as possible:
- let =
FEE
= 997 - let =
FEE_DIVISOR
= 1000 - let =
reserveIn
for exchange A or B- this refers to the reserves of the token that we're paying into the trade
- let =
reserveOut
for exchange A or B- this refers to the reserves of the token that we're getting out of the trade
For example, if we're arbitraging WETH -> TKN on exchange A, then TKN -> WETH on exchange B, our variables would be:
- =
WETH.reserves
- =
TKN.reserves
- =
TKN.reserves
- =
WETH.reserves
How to derive this formula is beyond the scope of this document, but if you want to dig deeper, check out this paper.
This formula is implemented by the getAmountIn
function in our smart contract, which relies on getNumerator
and getDenominator
to do the math (and to avoid "stack too deep" errors).
Now that we know how to calculate the optimal amount of WETH to send for an arbitrage, let's put it to use.
_executeArbitrage
_executeArbitrage
is the core function responsible for looking up trading prices, calculating the optimal buy/sell amounts, and executing the two trades that make up the arbitrage. It only takes three arguments:
function _executeArbitrage(
address firstPairAddress,
address secondPairAddress,
uint percentageToPayToCoinbase
) ...
We just tell it which token pairs to trade, and how much profit to tip the validator.
The function starts by reading the smart contract's own WETH balance. This is used later to verify our profits. We use the pair addresses to instantiate uniswap Pair contracts, which we pass to getPairData
to read the reserves, which we then use to calculate the optimal arbitrage with getAmountIn(firstPairData, secondPairData)
.
Uniswap token pairs refer to their tokens as token0
and token1
; token0
being the one whose address is numerically less than the other (e.g. 0x0123 < 0x0234); so we need to discern which token of the pair's two tokens is WETH. Our getPairData
function sets this in the isWETHZero
field. If WETH is token0, then we'll trade token0 -> token1 on exchange A, then token1 -> token0 on exchange B. If WETH is token1, then we just switch "token0" with "token1" and apply the same formula.
Once we know which token is which, we calculate the amount of tokens we'll receive from each trade. We use these values as inputs to the token pairs' swap functions. See swap
on the Uniswap V2 Pair contract for more details on how swaps work.
Once we've executed our trades, we should expect to have more ETH (or WETH) than we started with. But that won't always be the case. To ensure that we don't pay for an unprofitable trade, we check the WETH balance at the end of the _executeArbitrage
function. If the balance isn't greater than when we first called the function, the transaction will revert. This protects us from malicious tokens, unforseen market conditions, and a variety of other ways you can lose your money.
When we do turn a profit, we need to pay some of it to the validators/builders in order to get our transactions on-chain. Block builders have differing preferences & ordering algorithms, but a good rule of thumb is to use a gas price higher than the market average, and unless you know you have no competition, send at least 80% of the profit to block.coinbase
.
Compile & Deploy (optional)
If you want to run a capital-intensive strategy (not using flash loans) you'll have to deploy your own contract. This is not required if you use our flash loan contract. The flash loan contract is designed to send profits to the caller when the arbitrage is done. This allows anyone to execute arbitrages without paying to deploy the contract. The tradeoff with this is that it costs more gas to transfer the profit to your wallet than it does to keep the money in the contract. However, if you decide to deploy your own contract, its transactions will use less gas (at the risk of your contract containing a bug that might compromise the funds), but will only land bundles if it holds enough capital to buy the required tokens.
A simple way to deploy contracts is to use forge
from Foundry:
# compile contracts
forge build
# to test with a local fork:
anvil -f $MAINNET_RPC_URL --chain-id 1 &
# these vars are set to deploy on local fork; change as/if needed
export PRIVATE_KEY="0xac0974bec39a17e36ba4a6b4d238ff944bacb478cbed5efcae784d7bf4f2ff80"
export WETH_ADDRESS="0xC02aaA39b223FE8D0A0e5C4F27eAD9083C756Cc2"
export RPC_URL="http://localhost:8545"
# deploy flashloan arb contract
forge create -r $RPC_URL --private-key $PRIVATE_KEY BlindBackrunFlashLoan --constructor-args $WETH_ADDRESS
# or deploy capital-intensive contract
forge create -r $RPC_URL --private-key $PRIVATE_KEY BlindBackrun --constructor-args $WETH_ADDRESS
output:
No files changed, compilation skipped
Deployer: 0xf39Fd6e51aad88F6F4ce6aB8827279cffFb92266
Deployed to: 0xc075BC0f734EFE6ceD866324fc2A9DBe1065CBB1
Transaction hash: 0xe9567cce60dfdc1f815f4724340228f2af77ee5cc157a69d07c4b270fcab3a30
Tradeoffs
Our onchain searching strategy is relatively straightforward, but it has its drawbacks:
- calculating trade amounts onchain costs gas, making the strategy less efficient
- only Uniswap-V2-like trading pairs are compatible with this strategy
- Uniswap V3 uses a different algorithm, which is very costly to compute onchain
Offchain Searching
The more data you can compute offchain, the less gas your transactions have to spend. If you rely on offchain computations to define your strategy, you can save a lot of gas, giving you a competitive edge.
However, some data can only be used in an onchain setting. For instance, if you were to run the same arbitrage calculation algorithm offchain that we have onchain in the BlindBackrunLogic contract, you'd probably get different results. This is because prices can change in the block; before your transaction is executed; but off-chain, we can't see that until the block has been finalized. The advantage of onchain searching is that you always have immediate access to the latest system state, so your calculations will always be accurate.
Other Exchanges
We strictly use Uniswap V2 because its pricing algorithm is simple, making arbitrages easily calculable. Uniswap V2 and Sushiswap use the same pricing algorithms, so we efficiently arb between those two exchanges. Uniswap V3 math is more complicated, making arbitrages on V3 very inefficient to calculate onchain.
However, Uniswap V3+ processes much more trade volume than V2. To improve your profits, consider developing a strategy that integrates Uniswap V3 into your own contract. It will likely involve probabilistic methods. Also note: Uniswap V4 uses the same pricing math as V3.
MEV-Share also supports Balancer and Curve.
Now that the core contract is ready, let's add flash loans. Read on in the next page.