Client Troubleshooting
See https://youtu.be/nY2SFjaDXHw?si=Sd-ko1lTmpgkz_VH&t=1560
Process running
Verify that the process is running (run on validator machine). In a new terminal window, shh into your server
ps aux | grep agave-validatorYou should see a line in the output that includes agave-validator with all the flags that were added to your validator.sh script.
Check the logs
Make sure your validator is producing reasonable log output (run on validator machine). In a new terminal window, ssh into your validator machine, switch users to the sol user and tail the logs:
su - sol
tail -f /home/sol/logs/agave-validator.logCheck version
Check the version you are starting with (run on validator machine). Useful if your validator is in some kind of restart loop
grep -B1 'Starting validator with' /home/sol/logs/agave-validator.log -A50Check PoH speed
grep -B1 'PoH speed check' /home/sol/logs/agave-validator.loghttps://discord.com/channels/428295358100013066/1187805174803210341/1346883323020050492 have you tried running on testnet, just to check? cpu may not be fast enough for poh if you grep for poh speed check in the log, post it here. you'll have to add the log to your startup cmd grep 'PoH speed check' log/validator.log
https://discord.com/channels/428295358100013066/1187805174803210341/1347136943825227776 Backup validator mainnet (EPYC 9254) [2025-03-05T11:42:00.837392249Z INFO solana_core::validator] PoH speed check: computed hashes per second 16740697, target hashes per second 10000000
Check Gossip
Make sure that the validator has registered itself with the gossip network (run anywhere)
Check voting readyness
Verify that your validator is ready to be a voting participant of the network (run anywhere).
After you have verified that your validator is in gossip, you should stake some SOL to your validator. Once the stake has activated (which happens at the start of the next epoch)
Check catchup speed
Check catchup speed with solana catchup (run on validator machine)
It tells you how far behind the network your validator is and how quickly you are catching up
If you use
--private-rpcthen you need to pass--our-localhosthere. See https://github.com/solana-labs/solana/issues/8407?ref=solana.ghost.io
Agave-Monitor
Monitor with agave-validator monitor (run on validator machine)
Check ports
Check that ports 8801, ... are open after agave-validator is running (run on validator machine)
It can take a few minutes for the process to open ports after the validator started
Check connection is possible from validator machine to the network entry points (run on validator machine)
Check reachability
Check connection is possible from the outside to your validator machine (run anywhere)
Check gossip entrypoint
Make sure network entry points resolve to the expected IP addresses (run anywhere). I've seen issues with outdated DNS server on Solana's side:
Check NAT
Check if you are behind a NAT (run on validator machine)
Get snapshot manually
Manually getting snapshot from another validator
Last updated
Was this helpful?