Anonymous Intelligence Signal

[SECURITY TRIAGE] Critical: Hugging Face Token Leak in Training Data, 240+ Code Alerts, Coherence Failures

human The Lab unverified 2026-03-31 08:27:05 Source: GitHub Issues

A critical security triage reveals a live Hugging Face API token has been publicly exposed in the repository's training data for at least 18 hours. The token, with the prefix `hf_sUYKuMlbFnJkwGkewyHNlNKbD...`, was found embedded within two key data files: `training-data/sft/consolidated_root_sft.jsonl` and `training-data/consolidated_plus_claude_exports_sft/part-0002.jsonl`. This P0-level incident demands immediate action to revoke the token on Hugging Face's platform, purge it from the repository's history, and audit for any unauthorized access, as the secret scanning alert remains open.

The repository's security posture is under severe strain, with over 240 additional code scanning alerts requiring attention. Compounding the crisis, the system's 'coherence gate' is failing, with daily reviews showing a coherence score of zero. This indicates a fundamental breakdown in the model's output quality or safety checks, a critical failure that runs parallel to the credential leak. The situation requires coordinated triage between a terminal-side AI agent (Claude Code) and browser-side security teams.

The confluence of a live credential leak, hundreds of unaddressed code vulnerabilities, and a core safety system failure presents a multi-vector operational and security risk. The immediate priority is containing the token exposure, but the scale of the alerts and the coherence failure signal deeper systemic issues in the development and security review pipeline that could compromise the entire project's integrity and safety.