Overview
This project automates threat detection instantiation or overhaul by first learning the unique data lake schema (all client environments are unique) and correlating that schema against a TTP framework (think MITRE). The schema discovery feeds a relational memory store that maps available telemetry to framework techniques. Once the schema is known we’re also able to audit existing detections and event sources to:
- Reveal any gap in coverage
- Identify potential detections given the available data set
- Curate detections for entire MITRE domains in bulk
Query Generation
Query generation uses Knowledge Graph-Augmented Generation (KGAG), a structured alternative to RAG that grounds every query against the discovered schema instead of relying on vector similarity and LLM inference. Benchmarked against 6 architectures including a 3B parameter LLM with full catalog context. The deterministic KGAG engine outperformed all of them.
Capabilities
The solution then is able to produce:
- NL-to-structured-query searching with .976 accuracy, a marked improvement from the vendor’s .36 accuracy
- Detection automation aligning with the chosen framework domain
- Detection automation given the available data sources
- Coverage assessment and gap analysis per framework tactic
- Knowledge object management (clean up my env!)
- Data modeling suggestions and implementation
Benchmark Comparison

This project’s Benchmarks:
