Files
mev-beta/PRODUCTION_DEPLOYMENT.md
Administrator 02e169c0e1 fix(v2): resolve critical integer overflow bug and add production deployment guide
This commit fixes a critical bug causing negative configuration values due to
integer overflow and adds comprehensive production deployment documentation.

## Critical Bug Fixed

**Issue**: Position size and loss limits showing negative values
**Root Cause**: Using big.NewInt(1e18) causes int64 overflow
- 1e18 = 1,000,000,000,000,000,000 (exceeds int64 max: 9,223,372,036,854,775,807)
- Results in wrap-around to negative values

**Affected Values:**
- MaxPositionSize: -8.4467 ETH → 10.0000 ETH ✓
- MaxDailyVolume: 7.7663 ETH → 100.0000 ETH ✓
- MaxHourlyLoss: (negative) → 0.1000 ETH ✓
- MaxDailyLoss: (negative) → 0.5000 ETH ✓

## Changes Made

### 1. Fix big.Int Construction (cmd/mev-bot-v2/main.go:439-455)

**Before (BROKEN):**
```go
MaxHourlyLoss: new(big.Int).Mul(big.NewInt(1), big.NewInt(1e17))  // OVERFLOW!
MaxPositionSize: new(big.Int).Mul(big.NewInt(10), big.NewInt(1e18))  // OVERFLOW!
```

**After (FIXED):**
```go
MaxHourlyLoss: new(big.Int).SetUint64(100000000000000000)  // 0.1 ETH (10^17 wei)
MaxDailyLoss: new(big.Int).SetUint64(500000000000000000)   // 0.5 ETH
MinProfit: new(big.Int).SetUint64(10000000000000000)       // 0.01 ETH
MinSwapAmount: new(big.Int).SetUint64(1000000000000000)    // 0.001 ETH
MaxPositionSize: new(big.Int).Mul(big.NewInt(10), new(big.Int).Exp(big.NewInt(10), big.NewInt(18), nil))
MaxDailyVolume: new(big.Int).Mul(big.NewInt(100), new(big.Int).Exp(big.NewInt(10), big.NewInt(18), nil))
```

### 2. Fix Display Function (cmd/mev-bot-v2/main.go:59-68)

**Before (BROKEN):**
```go
fmt.Sprintf("%.4f", float64(config.MaxPositionSize.Int64())/1e18)  // Int64() overflow!
```

**After (FIXED):**
```go
weiToEth := func(wei *big.Int) string {
    ethFloat := new(big.Float).SetInt(wei)
    ethFloat = ethFloat.Quo(ethFloat, big.NewFloat(1e18))
    result, _ := ethFloat.Float64()
    return fmt.Sprintf("%.4f", result)
}
```

### 3. Production Deployment Guide (PRODUCTION_DEPLOYMENT.md)

New comprehensive guide covering:
- **4-Phase Deployment Plan**:
  - Phase 1: Mainnet dry-run (48 hours)
  - Phase 2: Skipped (Anvil fork only, no testnet)
  - Phase 3: Minimal capital test (0.01 ETH)
  - Phase 4: Gradual scale-up (1-2 weeks)

- **Complete Configuration Examples**:
  - Conservative limits for each phase
  - Environment variable reference
  - Docker deployment commands

- **Emergency Procedures**:
  - 3 methods to stop bot immediately
  - Verification steps
  - Wallet balance checking

- **Monitoring Checklists**:
  - Every 4 hours: Status checks
  - Daily: Log analysis and P/L review
  - Weekly: Full health check and parameter tuning

- **Security Best Practices**:
  - Wallet security guidelines
  - RPC endpoint security
  - Container security hardening

- **Troubleshooting Guide**:
  - Common issues and solutions
  - Circuit breaker analysis
  - Performance debugging

## Test Results

All tests still passing with corrected values:
- **12/12 tests passing (100%)**
- Position size: 10.0000 ETH (correct)
- Daily volume: 100.0000 ETH (correct)
- Circuit breaker: 0.1 ETH hourly, 0.5 ETH daily (correct)

## Impact

**Before:** Bot would have incorrect risk limits, potentially:
- Blocking all trades (negative position size)
- Incorrect circuit breaker triggers
- Invalid loss tracking

**After:** All configuration values correct and production-ready

## Next Steps

1. Configure production wallet (PRIVATE_KEY)
2. Start Phase 1: 48h mainnet dry-run
3. Monitor and validate arbitrage detection
4. Proceed to Phase 3 if successful

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-11-11 01:35:08 +01:00

380 lines
9.6 KiB
Markdown

# MEV Bot V2 - Production Deployment Guide
**Last Updated:** 2025-11-11
**Status:** Ready for Mainnet Dry-Run Testing
**Test Results:** 12/12 Passing (100%)
---
## ⚠️ **Pre-Deployment Checklist**
### Critical Requirements
- [ ] Private key configured and secured
- [ ] Wallet funded (minimum 0.1 ETH for gas)
- [ ] RPC endpoints verified (mainnet Arbitrum)
- [ ] Emergency stop procedure documented and tested
- [ ] Monitoring alerts configured
- [ ] Team member available for 24h monitoring
### Safety Verification
- [x] All 12 safety tests passing
- [x] Emergency stop mechanism tested (8s detection)
- [x] Configuration values correct (no negative values)
- [x] Dry-run mode verified (0 transactions executed)
- [x] Circuit breaker configured (3 losses, 0.1 ETH hourly)
- [x] Position limits set (10 ETH max position, 100 ETH daily volume)
---
## 📋 **Deployment Phases**
### Phase 1: Mainnet Dry-Run (48 hours)
**Goal:** Validate arbitrage detection without executing trades
**Configuration:**
```bash
# .env settings for Phase 1
ENABLE_EXECUTION=false
DRY_RUN_MODE=true
ENABLE_SIMULATION=true
MIN_PROFIT_THRESHOLD=0.001 # 0.1%
```
**Success Criteria:**
- Bot runs stable for 48 hours
- Arbitrage opportunities detected
- No crashes or errors
- Profit calculations appear reasonable
**Monitoring:**
- Check logs every 4 hours
- Verify opportunities being detected
- Monitor resource usage (CPU, memory)
### Phase 2: Testnet Deployment (7 days)
**Status:** NOT READY - User requested Anvil fork only
**Note:** User specified deployments will be on Anvil fork of Arbitrum mainnet, not testnet.
### Phase 3: Minimal Capital Test (24-48 hours)
**Goal:** Execute real trades with minimal risk
**Configuration:**
```bash
# .env settings for Phase 3
ENABLE_EXECUTION=true
DRY_RUN_MODE=false
ENABLE_SIMULATION=true
# ULTRA-CONSERVATIVE LIMITS
MIN_PROFIT_THRESHOLD=0.01 # 1% minimum profit
MAX_POSITION_SIZE_ETH=0.01 # 0.01 ETH maximum
MAX_DAILY_VOLUME_ETH=0.1 # 0.1 ETH daily limit
MAX_CONSECUTIVE_LOSSES=1 # Stop after 1 loss
MAX_HOURLY_LOSS_ETH=0.01 # 0.01 ETH hourly loss limit
```
**Wallet Setup:**
- Use dedicated wallet (not primary funds)
- Fund with 0.1 ETH only
- Monitor balance every hour
**Success Criteria:**
- First trade executes successfully
- Profit/loss calculations accurate
- Circuit breaker triggers correctly if loss occurs
- Gas estimation within 10% of actual
### Phase 4: Gradual Scale-Up (1-2 weeks)
**Only proceed if Phase 3 succeeds**
**Week 1 Limits:**
```bash
MAX_POSITION_SIZE_ETH=0.1 # 0.1 ETH
MAX_DAILY_VOLUME_ETH=1.0 # 1.0 ETH
MAX_CONSECUTIVE_LOSSES=2 # 2 losses
```
**Week 2 Limits (if profitable):**
```bash
MAX_POSITION_SIZE_ETH=1.0 # 1 ETH
MAX_DAILY_VOLUME_ETH=10.0 # 10 ETH
MAX_CONSECUTIVE_LOSSES=3 # 3 losses
```
---
## 🔧 **Production Configuration**
### Environment Variables
**Required:**
```bash
# Wallet
PRIVATE_KEY=<your_private_key_hex> # 64-character hex (no 0x prefix)
# Arbitrum Mainnet RPC
ARBITRUM_RPC_ENDPOINT=wss://arbitrum-mainnet.core.chainstack.com/YOUR_KEY
ARBITRUM_WS_ENDPOINT=wss://arbitrum-mainnet.core.chainstack.com/YOUR_KEY
# Smart Contracts (Deployed)
CONTRACT_ARBITRAGE_EXECUTOR=0xec2a16d5f8ac850d08c4c7f67efd50051e7cfc0b
CONTRACT_FLASH_SWAPPER=0x5801ee5c2f6069e0f11cce7c0f27c2ef88e79a95
CONTRACT_UNISWAP_V2_FLASH_SWAPPER=0xc0b8c3e9a976ec67d182d7cb0283fb4496692593
# Arbiscan API
ARBISCAN_API_KEY=H8PEIY79385F4UKYU7MRV5IAT1BI1WYIVY
```
**Safety Settings:**
```bash
# Execution Control
ENABLE_EXECUTION=false # Start with false (dry-run)
DRY_RUN_MODE=true # Start with true
ENABLE_SIMULATION=true
ENABLE_FRONT_RUNNING=false
# Risk Limits (Conservative for Phase 1)
MIN_PROFIT_THRESHOLD=0.01 # 1% minimum
MAX_POSITION_SIZE_ETH=0.01 # 0.01 ETH
MAX_DAILY_VOLUME_ETH=0.1 # 0.1 ETH
MAX_SLIPPAGE_TOLERANCE=0.005 # 0.5%
# Circuit Breaker
MAX_CONSECUTIVE_LOSSES=1 # Stop after 1 loss
MAX_HOURLY_LOSS_ETH=0.01 # 0.01 ETH hourly
MAX_DAILY_LOSS_ETH=0.05 # 0.05 ETH daily
# Emergency Stop
EMERGENCY_STOP_FILE=/tmp/mev-bot-emergency-stop
```
---
## 🚀 **Deployment Commands**
### Build Production Image
```bash
podman build -t mev-bot-v2:production -f Dockerfile .
```
### Run with Production Config
```bash
podman run -d \
--name mev-bot-v2-prod \
--network host \
--restart unless-stopped \
--env-file .env \
-v $(pwd)/logs:/app/logs:z \
mev-bot-v2:production
```
### Monitor Logs
```bash
# Real-time logs
podman logs -f mev-bot-v2-prod
# Filter for opportunities
podman logs mev-bot-v2-prod 2>&1 | grep -i "opportunity"
# Filter for errors
podman logs mev-bot-v2-prod 2>&1 | grep -i "error"
# Check safety configuration
podman logs mev-bot-v2-prod 2>&1 | grep -A10 "SAFETY CONFIGURATION"
```
---
## 🚨 **Emergency Procedures**
### Immediate Stop
```bash
# Method 1: Emergency stop file (graceful, 10-second detection)
podman exec mev-bot-v2-prod touch /tmp/mev-bot-emergency-stop
# Method 2: Container stop (immediate)
podman stop mev-bot-v2-prod
# Method 3: Force kill (if unresponsive)
podman kill mev-bot-v2-prod
```
### Verification
```bash
# Check bot stopped
podman ps | grep mev-bot-v2-prod
# Check last logs
podman logs --tail 50 mev-bot-v2-prod
# Check wallet balance
cast balance <YOUR_WALLET_ADDRESS> --rpc-url https://arb1.arbitrum.io/rpc
```
---
## 📊 **Monitoring Checklist**
### Every 4 Hours
- [ ] Check bot is running: `podman ps`
- [ ] Review recent logs for errors
- [ ] Verify opportunities being detected
- [ ] Check wallet balance hasn't decreased unexpectedly
### Daily
- [ ] Review full logs: `podman logs mev-bot-v2-prod > daily_$(date +%Y%m%d).log`
- [ ] Analyze profit/loss
- [ ] Check gas usage vs profit
- [ ] Verify circuit breaker hasn't triggered
- [ ] Review arbitrage opportunity quality
### Weekly
- [ ] Full system health check
- [ ] Update RPC endpoints if needed
- [ ] Review and adjust risk parameters
- [ ] Backup logs and configuration
- [ ] Test emergency stop procedure
---
## ⚡ **Performance Metrics**
### Expected Behavior
- **Opportunities detected:** 1-10 per hour (highly variable)
- **Profitable opportunities:** 0.1-1% of all opportunities
- **Execution success rate:** >95% (when enabled)
- **Average profit per trade:** 0.01-0.1 ETH
- **Gas cost per trade:** 0.001-0.01 ETH
### Red Flags
- ⚠️ No opportunities detected for >6 hours
- ⚠️ Execution success rate <80%
- ⚠️ Average gas cost > average profit
- ⚠️ Circuit breaker triggering repeatedly
- ⚠️ Memory usage increasing over time
---
## 🔐 **Security Best Practices**
### Wallet Security
- Use dedicated wallet (not primary funds)
- Never expose private key in logs
- Store private key in secure vault
- Limit wallet balance to maximum daily volume + 10%
- Use hardware wallet for large deployments
### RPC Security
- Use private RPC endpoints (not public)
- Rotate API keys periodically
- Monitor RPC usage and rate limits
- Have backup RPC endpoints configured
### Container Security
- Run as non-root user (already configured)
- Limit container resources:
```bash
podman run --memory=2g --cpus=2 ...
```
- Mount logs as read-only from host
- Regular security updates
---
## 📝 **Troubleshooting**
### Bot Not Starting
```bash
# Check logs
podman logs mev-bot-v2-prod
# Common issues:
# 1. Missing PRIVATE_KEY → Add to .env
# 2. Invalid RPC endpoint → Test with: wscat -c wss://...
# 3. Port conflict → Check: netstat -tlnp | grep 9090
```
### No Opportunities Detected
```bash
# Verify RPC connection
cast block-number --rpc-url $ARBITRUM_RPC_ENDPOINT
# Check pool discovery
podman logs mev-bot-v2-prod 2>&1 | grep "pools_discovered"
# Lower profit threshold temporarily for testing
# Edit .env: MIN_PROFIT_THRESHOLD=0.001
```
### Circuit Breaker Triggered
```bash
# Check recent trades
podman logs mev-bot-v2-prod 2>&1 | grep -A5 "circuit breaker"
# Analyze why:
# 1. Real market losses → Normal, bot protecting you
# 2. Gas estimation errors → Adjust MAX_GAS_PRICE
# 3. Slippage issues → Increase MAX_SLIPPAGE_TOLERANCE slightly
# Reset circuit breaker by restarting:
podman restart mev-bot-v2-prod
```
---
## 📈 **Success Criteria**
### Phase 1 Success (Dry-Run)
- ✅ 48 hours of stable operation
- ✅ 10+ opportunities detected
- ✅ No crashes or errors
- ✅ Reasonable profit calculations
### Phase 3 Success (Live Trading)
- ✅ First trade profitable or break-even
- ✅ Circuit breaker working correctly
- ✅ Gas costs < 50% of gross profit
- ✅ No unexpected losses
### Long-term Success (1 month)
- ✅ Net positive profit after gas
- ✅ <5% of trades result in loss
- ✅ No emergency stops triggered
- ✅ Stable performance over time
---
## 🎯 **Current Status: Phase 1 Ready**
**What's Working:**
- ✅ All safety mechanisms tested (12/12 tests)
- ✅ Emergency stop verified (8-second detection)
- ✅ Configuration values correct
- ✅ Dry-run mode confirmed working
**Blockers to Production:**
- ⚠️ Need to configure production wallet (PRIVATE_KEY)
- ⚠️ Need to verify RPC endpoint connectivity
- ⚠️ Need 48h dry-run validation on mainnet
- ⚠️ Need to verify arbitrage opportunities exist
**Next Steps:**
1. Configure production wallet in .env
2. Start Phase 1 (48h dry-run on mainnet)
3. Monitor and validate opportunity detection
4. If successful, proceed to Phase 3 (minimal capital test)
---
## 📞 **Support & Resources**
- **Emergency Contact:** [Your contact info]
- **Logs Location:** `/docker/mev-beta/logs/`
- **Documentation:** `README.md`, `SAFETY_TEST_RESULTS.md`
- **Test Scripts:** `scripts/test_safety_mechanisms.sh`
---
**⚠️ WARNING: DO NOT deploy to mainnet with ENABLE_EXECUTION=true until Phase 1 dry-run is complete and validated.**