Application Security · Databricks Reference MVP
Unify AppSec findings across CMDB, SCM, SAST, SCA, Secrets, DAST, and WAF.¶
An MVP implementation of a data integration framework for application security, built on Databricks.
The platform ingests from AppSec sources via a connector for each source, normalizes findings to recommended schemas, and exposes analytics over them. The reference implementation runs on Databricks and is packaged as an Asset Bundle.
Ingest: every AppSec source has a per-source connector that lands raw data in a Bronze schema and projects normalized findings + entities into a canonical Silver layer.
flowchart LR
subgraph Sources["Sources"]
direction TB
GH[GitHub]
GL[GitLab]
SN[ServiceNow CMDB]
SQ[SonarQube]
SG[Semgrep]
DT[Dependency-Track]
TH[TruffleHog]
ZAP[OWASP ZAP]
WAF[AWS WAF]
end
subgraph Bronze["Bronze (raw)"]
direction TB
BG[bronze_github]
BGL[bronze_gitlab]
BSN[bronze_servicenow]
BSQ[bronze_sonarqube]
BSG[bronze_semgrep]
BDT[bronze_dependency_track]
BTH[bronze_trufflehog]
BZAP[bronze_owasp_zap]
BWAF[bronze_aws_waf]
end
subgraph Silver["Silver (canonical entities)"]
direction TB
SR["silver.repositories"]
SA["silver.applications"]
SAR["silver.app_repo_mapping"]
SF["silver.findings"]
SS["silver.suppression_rules"]
end
GH --> BG
GL --> BGL
SN --> BSN
SQ --> BSQ
SG --> BSG
DT --> BDT
TH --> BTH
ZAP --> BZAP
WAF --> BWAF
BG --> SR
BGL --> SR
BSN --> SA
BSQ --> SF
BSG --> SF
BDT --> SF
BTH --> SF
BZAP --> SF
BWAF --> SF
SR --> SAR
SA --> SAR
SR --> SF
SAR --> SF
Analytics & serving: five Gold tables aggregate Silver into daily snapshots; two Online Tables (~5 min lag) serve those plus an open-findings view to a Databricks App for sub-50 ms point lookups, while all Gold tables also feed dashboards.
flowchart LR
subgraph Silver2["Silver (inputs)"]
direction TB
SF["silver.findings"]
SR["silver.repositories"]
SAR["silver.app_repo_mapping"]
SS["silver.suppression_rules"]
end
subgraph Gold["Gold (5 OLAP Delta tables refreshed daily + 1 view)"]
direction TB
GR1["gold.app_risk_posture_daily"]
GR2["gold.mttr_by_source_severity_weekly"]
GR3["gold.coverage_matrix"]
GR4["gold.dedup_link_overlap"]
GR5["gold.cwe_owasp_heatmap"]
GVIEW["gold.app_repo_findings_open
(view)"]
end
subgraph OLTP["OLTP serving (Online Tables, ~5 min lag)"]
direction TB
OAR["gold_online.app_risk_posture"]
OARF["silver_online.app_repo_findings"]
end
subgraph Consumers["Consumers"]
direction TB
APP["Databricks App
(security-score endpoint)"]
DASH["Dashboards & SQL"]
end
SF --> GR1
SF --> GR2
SF --> GR3
SF --> GR4
SF --> GR5
SAR --> GR1
SAR --> GR5
SR --> GR3
SS --> GR1
SS --> GR2
SS --> GR4
SS --> GR5
SF --> GVIEW
SAR --> GVIEW
GR1 --> OAR
GVIEW --> OARF
OAR --> APP
OARF --> APP
GR1 --> DASH
GR2 --> DASH
GR3 --> DASH
GR4 --> DASH
GR5 --> DASH
Install order¶
-
Workspace bootstrap: catalog, schemas, jobs, secrets.
-
Wire each AppSec source.
-
Gold datasets, evidence scenarios, dashboards.