03 Building · Living resource

Functional Mapping of the Safety Tooling Landscape

A living map of the tools that keep platforms safer.

Existing frameworks for safety tooling lack the granularity required to map a rapidly diversifying ecosystem of internal and user-facing interventions. This workstream develops a comprehensive map, categorizing tools by functional utility, impact on user experience, and role in the safety lifecycle, from preventative design to reactive enforcement — and publishes it as a living, browsable resource.

A living map of the safety stack.

A growing catalogue of trust & safety tools — open-source and commercial, internal and user-facing. Six neighbourhoods on the map group the tools by what they do; switch to the card view to search or filter by topic.

The neighbourhood names are an evocative consolidation of the committee's preliminary topic buckets. A formal mapping by functional utility, user impact, and place in the safety lifecycle is being developed in parallel.

52 tools · 6 neighbourhoods

Click + drag to pan · scroll to zoom · click a pin to open the tool

Tooling Map illustration: a hand-drawn continent with six labelled neighbourhoods — Classifier Quarter, Guardrail Heights, Automation Alley, Reviewer's Row, Investigation Inlet, and The Hashlands.
Alice
Granite Guardian
Guardrails AI
Kanana Safeguard
Llama Guard
Llama Prompt Guard 2
Purple Llama
Risk Atlas Nexus
Roblox Guard 1.0
ShieldGemma
TrustedExecBench
Content Safety API
CoPE
Detoxify
gpt-oss-safeguard
Hive Classifiers
NSFW filtering
NSFW Keras Model
OSmod
Perspective API
Private Detector
Roblox Voice Safety Classifier
Safer by Thorn
Sentinel
Toxic Prompt RoBERTa
Implio by Besedo
Lasso Moderation
Osprey
Checkstep
Community Sift
Coop
ReTool
Altitude
Hasher Matcher Action (HMA)
Hasher-Matcher-Actioner (CLIP demo)
hma-matrix
Lattice Extract
MediaModeration (Wiki Extension)
PDQ
Perception
RocketChat CSAM
TMK
VPDQ
Adrift — awaiting a neighbourhood
Where the map is going

The proposed dimensions of the formal map.

As the committee firms up the map, each tool will be re-classified along three dimensions. The neighbourhoods on the map are a first cut; these are the more granular structures the committee plans to publish alongside the dataset.

Dimension i

Functional utility

What the tool actually does — classification, hash matching, review workflow, identity assurance, transparency reporting, and so on.

Dimension ii

Lifecycle position

Where in the safety lifecycle the tool acts — from preventative design, through detection, into responsive enforcement, and on to restorative measures.

Dimension iii

Impact on user experience

Whether the tool is internal (used by reviewers, engineers, or analysts) or user-facing (felt directly by people on the platform), and how visibly it shapes their experience.