CIIE — Counterfactual CpG Inverse Impact Engine
Local SHAP-style attributions + in silico CpG “neutralization” to test how perturbations shift a model’s decision. From disease ? explanatory CpGs ? genomic context.
Why counterfactuals?
Predictions alone do not convince clinicians. CIIE augments a classifier with what-if reasoning: if we attenuate or suppress specific CpG signals, does the predicted label change?
This provides actionable evidence that a subset of CpGs is causally influential for the model, not just correlated. It also links those CpGs to regulatory elements and genes to suggest plausible mechanisms.
Inputs & outputs
- Input: trained model + feature set (CpGs) + a patient sample.
- Compute: local attributions (SHAP-like) + counterfactual trials.
- Output: ranked CpGs & minimal subsets that flip/shift the decision; genomic mapping & annotations.
How it works
Visual example
For a given patient, CIIE ranks CpGs by local impact. We progressively neutralize top-k CpGs and track the predicted class probability. If a small subset flips the decision, we show it along with the genomic context.
Demo dataset and full interactive plots will be available in the public beta.
Planned API & uploads
Users will be able to upload raw IDAT or cleaned beta matrices and custom signature lists (CpG sets). The pipeline validates, normalizes and runs CIIE against selected models.
- POST /upload - dataset (IDAT/CSV/Parquet) + metadata
- POST /run-ciie - choose model, k strategy, neutralization rule
- GET /result/{id} - attributions, counterfactual subsets, genomic mapping
Security & privacy
Runs execute on secured on-prem compute over VPN; the public site acts as a thin front-end. Logs and versions ensure reproducibility.