π¬ Dashboard β Experiment Status
Placeholder Data
Results marked TBD will be populated after PEC2 experiments complete. Replace each TBD with actual CodeCarbon + statistical output.
All Experiments
TABLE experiment-id AS "ID", title AS "Experiment", status AS "Status", methodology AS "Method", tool AS "Tool", date-started AS "Started", date-completed AS "Done"
FROM #experiment
SORT date-started DESCBy Status
TABLE WITHOUT ID status AS "Status", length(rows) AS "Count", rows.file.link AS "Experiments"
FROM #experiment
GROUP BY statusPending Experiments
TABLE experiment-id AS "ID", title AS "Experiment", methodology AS "Method"
FROM #experiment
WHERE status = "planned"
SORT file.ctime ASCStage 1: Triage Results Placeholder
| Model | Strategy | F1-macro | Precision | Recall | p-value | r | gCOβeq |
|---|---|---|---|---|---|---|---|
| Heuristic baseline | β | TBD | TBD | TBD | β | β | 0 |
| TF-IDF + SVM | β | TBD | TBD | TBD | β | β | 0 |
| Llama 3.2 8B | Zero-Shot | TBD | TBD | TBD | TBD | TBD | TBD |
| Llama 3.2 8B | Few-Shot-3 | TBD | TBD | TBD | TBD | TBD | TBD |
| Llama 3.2 8B | Few-Shot-6 | TBD | TBD | TBD | TBD | TBD | TBD |
| Llama 3.2 8B | CoT | TBD | TBD | TBD | TBD | TBD | TBD |
| Mistral 7B | Zero-Shot | TBD | TBD | TBD | TBD | TBD | TBD |
| Mistral 7B | Few-Shot-3 | TBD | TBD | TBD | TBD | TBD | TBD |
| Mistral 7B | Few-Shot-6 | TBD | TBD | TBD | TBD | TBD | TBD |
| Mistral 7B | CoT | TBD | TBD | TBD | TBD | TBD | TBD |
Update when PEC2 experiments complete. Replace TBD with actual values.
Stage 2: Estimation Results Placeholder
| Model | Strategy | MAE (SP) | RMSE | vs. Mean Hist. | vs. Deep-SE | vs. CoGEE |
|---|---|---|---|---|---|---|
| Mean historical | β | TBD | TBD | baseline | β | β |
| Deep-SE | β | ~3.2 | β | β | baseline | β |
| CoGEE (GPT-4) | β | ~1.9 | β | β | β | baseline |
| Llama 3.2 8B | Zero-Shot | TBD | TBD | TBD | TBD | TBD |
| Llama 3.2 8B | Few-Shot-3 | TBD | TBD | TBD | TBD | TBD |
| Mistral 7B | CoT | TBD | TBD | TBD | TBD | TBD |
Carbon Summary
| Phase | Conditions | Total gCOβeq | Per condition | Per issue |
|---|---|---|---|---|
| Stage 1 | 8 LLM + 2 baseline | TBD | TBD | TBD |
| Stage 2 | TBD | TBD | TBD | TBD |
Fill with CodeCarbon output after experiments.