VXDF: Validated EXploitable Data Flow

1 day ago 1

VXDF Dashboard

VXDF (Validated eXploitable Data Flow) is a next-generation security validation platform for verifying, validating, and reporting on security findings from any scanner.

Core Architecture
Key Components
Validation Workflow
Data Model
Project Structure
Installation
Running the Application
Dynamic Configuration
Documentation
License

VXDF follows a modular microservices architecture with clear separation between:

Validation Engine - Core vulnerability verification logic
API Layer - RESTful interface for frontend integration
Data Processing - SARIF/DAST/CycloneDX parsing pipeline to transform inputs into the canonical VXDF Pydantic models
Evidence Collection - Automated exploit validation system
Reporting - Serialization of the VXDFModel Pydantic object to the standard VXDF format for export

Validation Engine: Core business logic for vulnerability verification
Flask API: REST endpoints for frontend integration
SQLAlchemy ORM: Database management with SQLite
Parser System: Modular input processors (SARIF, DAST, CycloneDX) that normalize findings to the VXDF Pydantic models
Validator Plugins: Vulnerability-specific validation logic
Docker Integration: Isolated validation environments

React/TypeScript: Modern UI with Vite build system
Dynamic Dashboard: Real-time validation statistics
Data Flow Visualization: Interactive vulnerability tracing
Evidence Viewer: Validation proof inspection
Report Generator: Export of findings in the standard VXDF format (JSON)

Input Ingestion
- Accepts SARIF, DAST JSON, CycloneDX SBOMs
- Normalizes findings to the common VXDF data model (defined by Pydantic models in api/models/vxdf.py)
Vulnerability Processing
- Filters by severity/vulnerability type
- Enriches with CWE/CVSS data
Automated Validation
- Docker-based isolated testing
- Evidence collection (HTTP requests, stack traces)
- Exploitability confirmation
Reporting
- Generates VXDF-standard reports by serializing the populated VXDFModel Pydantic object
- Maintains audit trail of validation attempts

The canonical data model for VXDF is defined using Pydantic in Python, located in api/models/vxdf.py. This provides a structured and validated way to represent security findings according to the VXDF specification.

The root model is VXDFModel, which encapsulates the entire VXDF document. Key nested models include:

VulnerabilityDetailsModel: Describes a single vulnerability, including its ID, title, description, severity, affected components, exploit flows, and evidence.
EvidenceModel: Details a piece of evidence, specifying its type (e.g., HTTP request, code snippet, screenshot) and the associated structured data.
LocationModel: Specifies a precise location relevant to a vulnerability (e.g., source code location, web endpoint).
SeverityModel: Captures severity information, including qualitative levels and quantitative scores like CVSS.
ExploitFlowModel: Describes a sequence of steps an attacker might take.

These Pydantic models are the source of truth for the VXDF data structure and are used to generate the normative JSON schema.

For the complete normative JSON schema, refer to docs/normative-schema.json. The detailed specification document can be found in [docs/Validated Exploitable Data Flow (VXDF) Format.md](docs/Validated Exploitable Data Flow (VXDF) Format.md).

(Previously, this section showed an illustrative TypeScript interface, which is now superseded by the Pydantic models and the normative JSON schema.)

vxdf/ ├── api/ # Backend API and core functionality │ ├── core/ # Core validation engine │ ├── models/ # VXDF Pydantic models (api/models/vxdf.py defines the canonical schema) │ ├── parsers/ # Input format parsers │ └── validators/ # Vulnerability validators ├── config/ # Configuration files │ └── config.json # Main application configuration ├── data/ # Database and data files │ └── vxdf_validate.db # SQLite database ├── docs/ # Documentation, including the normative JSON schema (docs/normative-schema.json) and specification MD ├── frontend/ # React/TypeScript frontend │ ├── src/ # Frontend source code │ ├── package.json # NPM dependencies │ └── vite.config.ts # Frontend configuration ├── logs/ # Log files │ ├── backend.log # Backend log output │ └── frontend.log # Frontend log output ├── scripts/ # Application scripts │ ├── start-all.sh # Start both backend and frontend │ ├── stop-all.sh # Stop all services │ ├── start.sh # Start just the backend │ └── start-frontend.sh # Start just the frontend ├── test-data/ # Sample data for testing │ └── sample-sarif.json # Example SARIF file ├── tests/ # Test scripts and utilities ├── venv/ # Python virtual environment (created on first run) ├── LICENSE # License file ├── Makefile # Make targets for common operations ├── README.md # This file └── requirements.txt # Python dependencies (symlink to api/requirements.txt)

api/requirements.txt - Backend Python dependencies
api/models/vxdf.py - Canonical Pydantic models for the VXDF schema.
docs/normative-schema.json - The normative JSON schema generated from Pydantic models.
frontend/vite.config.ts - Frontend configuration, including API proxy settings
config/config.json - Application configuration
data/vxdf_validate.db - SQLite database (created on first run)

Backend API: http://localhost:6789
Frontend UI: http://localhost:3000
Log files: logs/backend.log and logs/frontend.log

Python 3.9+
Node.js 16+ and npm
Git

git clone https://github.com/your-username/vxdf.git cd vxdf pip install -r requirements.txt cd frontend npm install cd ..

▶️ Running the Application

The easiest way to start the application is to use the single script:

# Start both backend and frontend with one command ./scripts/start-all.sh # To stop all services ./scripts/stop-all.sh

The backend will be available at http://localhost:6789 and the frontend at http://localhost:3000.

You can also start the services separately if needed:

# Start just the backend API ./scripts/start.sh # In a separate terminal, start just the frontend ./scripts/start-frontend.sh

If you prefer to set up the application manually:

Create a virtual environment:

python3 -m venv venv source venv/bin/activate

Install dependencies:

pip install -r requirements.txt

Initialize the database (if it doesn't exist):

python3 api/load_sarif_to_db.py

Start the API server:

Navigate to the frontend directory:

Install dependencies:

Start the development server:

Sidebar navigation: Driven by frontend/src/config/sidebarConfig.ts (or API in future)
Dashboard data: All stats, charts, and tables are fetched from the backend API
Branding: Logo and product name are configurable
Alerts/Notifications: (Planned) Will be fetched from API

Import Order: Always import models in the correct order to avoid circular imports:
- First import database modules
- Then import model classes
Database Connection: Ensure the database file (data/vxdf_validate.db) is accessible and correctly initialized.
API Port: The API runs on port 6789 by default. Make sure this port is available.
Frontend Proxy: The frontend uses a proxy to communicate with the API. Check frontend/vite.config.ts if you change the API port.