πŸ”¬ Dashboard β€” Experiment Status

Placeholder Data

Results marked TBD will be populated after PEC2 experiments complete. Replace each TBD with actual CodeCarbon + statistical output.


All Experiments

TABLE experiment-id AS "ID", title AS "Experiment", status AS "Status", methodology AS "Method", tool AS "Tool", date-started AS "Started", date-completed AS "Done"
FROM #experiment
SORT date-started DESC

By Status

TABLE WITHOUT ID status AS "Status", length(rows) AS "Count", rows.file.link AS "Experiments"
FROM #experiment
GROUP BY status

Pending Experiments

TABLE experiment-id AS "ID", title AS "Experiment", methodology AS "Method"
FROM #experiment
WHERE status = "planned"
SORT file.ctime ASC

Stage 1: Triage Results Placeholder

ModelStrategyF1-macroPrecisionRecallp-valuergCOβ‚‚eq
Heuristic baselineβ€”TBDTBDTBDβ€”β€”0
TF-IDF + SVMβ€”TBDTBDTBDβ€”β€”0
Llama 3.2 8BZero-ShotTBDTBDTBDTBDTBDTBD
Llama 3.2 8BFew-Shot-3TBDTBDTBDTBDTBDTBD
Llama 3.2 8BFew-Shot-6TBDTBDTBDTBDTBDTBD
Llama 3.2 8BCoTTBDTBDTBDTBDTBDTBD
Mistral 7BZero-ShotTBDTBDTBDTBDTBDTBD
Mistral 7BFew-Shot-3TBDTBDTBDTBDTBDTBD
Mistral 7BFew-Shot-6TBDTBDTBDTBDTBDTBD
Mistral 7BCoTTBDTBDTBDTBDTBDTBD

Update when PEC2 experiments complete. Replace TBD with actual values.


Stage 2: Estimation Results Placeholder

ModelStrategyMAE (SP)RMSEvs. Mean Hist.vs. Deep-SEvs. CoGEE
Mean historicalβ€”TBDTBDbaselineβ€”β€”
Deep-SEβ€”~3.2β€”β€”baselineβ€”
CoGEE (GPT-4)β€”~1.9β€”β€”β€”baseline
Llama 3.2 8BZero-ShotTBDTBDTBDTBDTBD
Llama 3.2 8BFew-Shot-3TBDTBDTBDTBDTBD
Mistral 7BCoTTBDTBDTBDTBDTBD

Carbon Summary

PhaseConditionsTotal gCOβ‚‚eqPer conditionPer issue
Stage 18 LLM + 2 baselineTBDTBDTBD
Stage 2TBDTBDTBDTBD

Fill with CodeCarbon output after experiments.