Overview
Summary
This article explains how beaconcha.in assigns human‑readable tags to Ethereum validators. Tags make it easier to understand which validators belong to well‑known entities (e.g., Lido, Rocket Pool), as well as large independent operators (“whales”). It covers data sources, the step‑by‑step tagging pipeline, how often tagging runs, and how conflicts are resolved. Shout‑out: A big thank‑you to Hildobby for maintaining a community dataset of validator/entity mappings used by many in the ecosystem.What is a validator tag?
A validator tag is a short, human-readable label associated with one or more validators. Tags help you:- Attribute validator performance to entities via their BeaconScore benchmark value.
- Understand decentralization by operator/entity.
- Identify large clusters of validators controlled by the same address.
CoinbaseorBinanceLidoorLido (SimpleDVT)(Lido modules)Rocket Pool(Rocket Pool validators)Whale_0x1234(an independent cluster labeled by address prefix)
High‑level pipeline
At a high level, the tagging module:- Imports community tags from the Hildobby dataset
- Queries on‑chain contracts to detect Lido validators (Curated, Simple DVT, Community Staking)
- Queries on‑chain contracts to detect Rocket Pool validators
- For any remaining untagged validators, clusters by withdrawal address and assigns whale tags if the cluster balance exceeds 320 ETH
- For any remaining untagged validators, clusters by deposit (funding) address and assigns whale tags if the cluster balance exceeds 320 ETH
Step‑by‑step details
1) Community tags via Hildobby- Source: Hildobby’s public validator/entity dataset (community maintained).
- Matching: The dataset contains mappings of validator indices or pubkeys to known entities. We import those tags directly.
- Why first? Community‑maintained mappings often include entities that are not easily inferred from on‑chain structures.
- Note: We retain Hildobby’s naming semantics and apply them as the baseline when no higher‑certainty on‑chain inference is available.
Lido Curated ModuleLido SimpleDVT ModuleLido Community Staking Module
- Enumerate the validator set exposed by each module’s contracts
- Map each validator pubkey to the appropriate tag, typically
Lido,Lido (SimpleDVT), orLido (CSM) - Prefer these determinations over community tags when a contract says a validator belongs to Lido
- Enumerate active and historical minipools
- Retrieve associated validator pubkeys
- Tag those validators as
RocketPool
- Cluster key: withdrawal address
- Cluster size metric: total effective balance of validators in the cluster
- Threshold:
> 320 ETH(equivalent to more than 10 full validators at 32 ETH each) - Tag format:
Whale_0x1234where0x1234is a short prefix of the withdrawal address for readability
- Cluster key: deposit/funding (from) address
- Cluster size metric: total effective balance of validators funded by that address
- Threshold:
> 320 ETH - Tag format:
Whale_0x1234
Tag priority and conflict resolution
When a validator qualifies for multiple tags, we follow this precedence (from highest to lowest certainty):- On‑chain entity detection (Lido modules, Rocket Pool)
- Community dataset (Hildobby)
- Inferred whale clustering by withdrawal address
- Inferred whale clustering by deposit address
Schedule and freshness
The tagging system runs on a fixed UTC schedule:- Daily at 10:00 UTC: Full tagging pipeline (all steps) runs in order, followed by a precompute job that updates the entity data dashboards.
- Hourly (other than 10:00): Precompute only. This ensures UI remains fast and consistent even between daily runs.
Examples
- A validator included in Lido’s Curated module contracts will be tagged
Lido. - An independent operator running 15 validators with the same withdrawal address will be tagged
Whale_0xABCDwhere0xABCDis a short prefix of their withdrawal address. - A cluster of 8 validators funded from address
0xF...will not get a whale tag because the balance threshold (> 320 ETH) is not met.
How to request a correction
If you believe a tag is incorrect:- Open an issue in the beaconcha.in support channel and include validator indices or pubkeys, plus the correct attribution if known.
- For on‑chain attributions (Lido/Rocket Pool), please provide the relevant contract references or transaction links.
Frequently asked questions
- Why does a whale tag use a short address prefix?
- For readability in the UI. The underlying full address is retained for accuracy.
- Will small operators get whale tags?
- No. The threshold is strict: only clusters with total balance
> 320 ETHqualify.
- No. The threshold is strict: only clusters with total balance
- When are tags assigned for new validators?
- Tags refresh daily with new data and may evolve if on‑chain ownership changes or the community dataset is updated.
- Does tagging affect rewards or protocol behavior?
- No. Tagging is a UI attribution feature only; it does not interact with consensus or execution layer incentives.

