Train Github Model
Ensemble: RF + XGBoost + HGB + LightGBM + SVM with 5-fold CV
What gets trained?
Three models — Random Forest, XGBoost, and SVM —
are combined into a soft-voting ensemble.
Feature importance and SHAP explanations are
auto-generated after training.
Required: CSV must have a label column — one of:
label, is_fake, fake,
target, is_bot, bot
(0 = Legit, 1 = Fake)
Cross-validated metrics
Feature importance chart
Distribution analysis