Comprehensive proof, validation, and backtest testing for Agencio Predict
Validates mathematical accuracy and correctness of calculations. These are the foundation tests.
Tests all performance metrics against known values with hand-calculated expected results.
Validates all 500+ DSL primitives return expected types and handle edge cases.
Tests the 12 quantitative analysis modules for institutional-grade accuracy.
Recent expert-level audit identified and fixed three mathematical issues:
Replaced simple average predictor with proper OLS AR coefficient estimation.
F = ((RSS_r - RSS_u) / k) / (RSS_u / (n - 2k - 1))
Implemented MacKinnon (1994, 2010) regression-based critical values.
c(p, n) = β_∞ + β₁/n + β₂/n²
Full multivariate OLS to avoid omitted variable bias.
β = (X'X)⁻¹ X'Y
Historical backtesting against known market periods to validate trading behavior and identify anomalies.
Tests trading strategies against known historical periods and market regimes.
Scans for data quality issues and suspicious patterns that could affect trading.
End-to-end proof that platform features work as advertised. Demonstrates complete workflows.
Proves the complete algorithm workflow from natural language to paper trading.
Proves the complete stock research workflow from scan to alerts.
| Date | Suite | Status | Duration | Tests | Pass | Fail | Details |
|---|---|---|---|---|---|---|---|
|
Run the test suite to populate results. Results will be saved to scripts/testing/results/
|
|||||||
| Category | Component | Tests | Coverage |
|---|---|---|---|
| Validation | Backtest Metrics (Sharpe, Sortino, VaR, etc.) | 15 | Complete |
| Validation | DSL Primitives (500+ primitives) | 16 categories | Complete |
| Validation | Quant Modules (12 modules) | 12 | Complete |
| Backtest | Historical Strategies | 5 strategies × 4 periods | Complete |
| Backtest | Anomaly Detection | 7 symbols × 12 checks | Complete |
| Proof | Algorithm Builder E2E | 9 tests | Complete |
| Proof | Stock Hunter E2E | 7 tests | Complete |
| Planned | Trading Signals | TBD | Planned |
| Planned | Risk Guardrails (L1-L4) | TBD | Planned |
| Planned | Fund Manager E2E | TBD | Planned |