Skip to content

Application Security · Databricks Reference MVP

Unify AppSec findings across CMDB, SCM, SAST, SCA, Secrets, DAST, and WAF.

An MVP implementation of a data integration framework for application security, built on Databricks.

The platform ingests from AppSec sources via a connector for each source, normalizes findings to recommended schemas, and exposes analytics over them. The reference implementation runs on Databricks and is packaged as an Asset Bundle.

Ingest: every AppSec source has a per-source connector that lands raw data in a Bronze schema and projects normalized findings + entities into a canonical Silver layer.

flowchart LR
  subgraph Sources["Sources"]
    direction TB
    GH[GitHub]
    GL[GitLab]
    SN[ServiceNow CMDB]
    SQ[SonarQube]
    SG[Semgrep]
    DT[Dependency-Track]
    TH[TruffleHog]
    ZAP[OWASP ZAP]
    WAF[AWS WAF]
  end

  subgraph Bronze["Bronze (raw)"]
    direction TB
    BG[bronze_github]
    BGL[bronze_gitlab]
    BSN[bronze_servicenow]
    BSQ[bronze_sonarqube]
    BSG[bronze_semgrep]
    BDT[bronze_dependency_track]
    BTH[bronze_trufflehog]
    BZAP[bronze_owasp_zap]
    BWAF[bronze_aws_waf]
  end

  subgraph Silver["Silver (canonical entities)"]
    direction TB
    SR["silver.repositories"]
    SA["silver.applications"]
    SAR["silver.app_repo_mapping"]
    SF["silver.findings"]
    SS["silver.suppression_rules"]
  end

  GH --> BG
  GL --> BGL
  SN --> BSN
  SQ --> BSQ
  SG --> BSG
  DT --> BDT
  TH --> BTH
  ZAP --> BZAP
  WAF --> BWAF

  BG --> SR
  BGL --> SR
  BSN --> SA
  BSQ --> SF
  BSG --> SF
  BDT --> SF
  BTH --> SF
  BZAP --> SF
  BWAF --> SF

  SR --> SAR
  SA --> SAR

  SR --> SF
  SAR --> SF

Analytics & serving: five Gold tables aggregate Silver into daily snapshots; two Online Tables (~5 min lag) serve those plus an open-findings view to a Databricks App for sub-50 ms point lookups, while all Gold tables also feed dashboards.

flowchart LR
  subgraph Silver2["Silver (inputs)"]
    direction TB
    SF["silver.findings"]
    SR["silver.repositories"]
    SAR["silver.app_repo_mapping"]
    SS["silver.suppression_rules"]
  end

  subgraph Gold["Gold (5 OLAP Delta tables refreshed daily + 1 view)"]
    direction TB
    GR1["gold.app_risk_posture_daily"]
    GR2["gold.mttr_by_source_severity_weekly"]
    GR3["gold.coverage_matrix"]
    GR4["gold.dedup_link_overlap"]
    GR5["gold.cwe_owasp_heatmap"]
    GVIEW["gold.app_repo_findings_open
(view)"]
  end

  subgraph OLTP["OLTP serving (Online Tables, ~5 min lag)"]
    direction TB
    OAR["gold_online.app_risk_posture"]
    OARF["silver_online.app_repo_findings"]
  end

  subgraph Consumers["Consumers"]
    direction TB
    APP["Databricks App
(security-score endpoint)"]
    DASH["Dashboards & SQL"]
  end

  SF --> GR1
  SF --> GR2
  SF --> GR3
  SF --> GR4
  SF --> GR5
  SAR --> GR1
  SAR --> GR5
  SR --> GR3
  SS --> GR1
  SS --> GR2
  SS --> GR4
  SS --> GR5

  SF --> GVIEW
  SAR --> GVIEW

  GR1 --> OAR
  GVIEW --> OARF

  OAR --> APP
  OARF --> APP
  GR1 --> DASH
  GR2 --> DASH
  GR3 --> DASH
  GR4 --> DASH
  GR5 --> DASH

Install order

Connector categories

  • 1. SCM


    GitHub, GitLab. Must be installed first; populates silver.repositories.

  • 2. CMDB


    ServiceNow.

  • 3. SAST


    SonarQube, Semgrep.

  • 4. SCA


    Dependency-Track.

  • 5. Secrets


    TruffleHog.

  • 6. DAST


    OWASP ZAP.

  • 7. WAF


    AWS WAF.