RISE-NLP is an interactive auditing system for fairness diagnostics in probabilistic text classification. Instead of relying only on threshold-based scalar metrics (for example, Demographic Parity and Equalized Odds), RISE-NLP analyzes signed probability residuals (d = p_hat_plus - y) and compares subgroup residual distributions through quantile curves. This reveals where across the score range subgroup disparities concentrate, including central regions, transition bands, and high-confidence tails.
If the video does not play in your browser, watch or download it on Dropbox.
The interface links three layers of analysis in one workflow:

Three-stage RISE-NLP architecture: input source, residual computation and metrics, and interactive rendering for auditing.

Instance-level inspection view linked to selected residual-percentile regions. Examples may contain offensive language from hate-speech datasets.
Fdist directly summarizes distributional separation between subgroup residuals and is visually grounded as the shaded area between curves.Fdist remains small (~0.055-0.058), supporting diagnostic specificity.Fdist increases substantially (for example, ~0.137-0.150 on HateXplain), with visible subgroup separation.