Cybersecurity Disclosure Agent
by Scott DeGeest
An agent that tracks cybersecurity incidents reported to the SEC, built by Scott DeGeest. Helps supply chain professionals quickly find out if a company has disclosed a data breach or ransomware attack. Handles raw XML/PDF from the SEC, invalid XML, subsidiary relationships (e.g. Change Healthcare ↔ United Health Group), and token monitoring for cost and context limits.

Project from the first cohort of the AI Engineering Buildcamp, by Scott DeGeest (Principal Data Scientist).
The system downloads raw files (often in XML or PDF) from the SEC website. Scott built logic to handle both valid and invalid XML structures. Data is converted and indexed in Elasticsearch. A key challenge was modeling subsidiaries so the agent knows that e.g. "Change Healthcare" is related to "United Health Group." He also added a monitor for input and output tokens to keep an eye on costs and context limits.
Tech stack: SEC data ingestion, XML/PDF parsing, Elasticsearch, token monitoring.