This is the curated local MacBook Air result set for the PyCon talk. Use this directory instead of older quick-validation outputs.
The MacBook Air tier is for correctness, debugging, reference checks, and lightweight evidence. It should not be presented as final large-scale speed evidence or compared directly with Linux server CPU/A100 results.
Core full-grid validation:
kmeans_correctness.csv: full required k-means grid, 1,440 scenarios covered.permutation_equivalence.csv: full required permutation equivalence grid.permutation_calibration.csv: original full-grid null calibration rows.permutation_power.csv: original full-grid power rows.Extra evidence added after visual review:
kmeans_shape_stress.csv: targeted K=20/50, d=100, N=50,000, and selected N=100,000 k-means shape stress.permutation_calibration_extended.csv: 100 extra null calibration replicates.permutation_power_extended.csv: 168 rows over denser delta and signal-fraction settings.permutation_runtime_scaling_extended.csv: p/R runtime scaling with explicit memory-risk skips.extra_evidence_summary.json: row and status counts for the extra evidence.Figure metadata:
figure_manifest.csv: canonical list of primary figures and their source CSVs.figures/: deck-ready figures generated by experiments.visualization.plot_macbook_air_evidence.skipped_memory_risk rows for unsafe reference broadcast cases.skipped_memory_risk rows.latest CSV files.figures/kmeans_recovery_scenario_facets.pngfigures/kmeans_shape_stress_runtime.pngfigures/kmeans_runtime_recovery_tradeoff.pngfigures/kmeans_reference_equivalence.pngfigures/permutation_calibration_extended.pngfigures/permutation_power_extended.pngfigures/permutation_runtime_scaling_extended.pngfigures/permutation_equivalence_detail.pngUse the project default local environment:
python -m experiments.run_macbook_evidence_extra \
--output-dir experiments/results/macbook_air_long/latest \
--checkpoint-every 20 --max-iter 15
python -m experiments.visualization.plot_macbook_air_evidence \
--results-dir experiments/results/macbook_air_long/latest
The extra evidence runner appends rows and skips existing run_ids, so it can be resumed after interruption.