What are the main ensemble methods covered in this skill?

The skill covers voting classifiers, bagging, boosting, stacking, and dynamic ensemble selection. It demonstrates voting ensembles with scikit-learn's VotingClassifier, advanced stacking via an AdvancedStacker class that generates out-of-fold meta-features, and dynamic ensemble selection using k-nearest-neighbors to weight predictions based on local model competence.

How does the skill help ensure ensemble diversity?

The skill covers four types of diversity: algorithm diversity (combining different algorithm types like tree-based, linear, and neural), data diversity (using different subsets/features of training data), hyperparameter diversity (varying model configurations), and training diversity (different random seeds or bootstrap samples). It also provides diagnostics via a pairwise prediction-correlation matrix to validate that base-model correlation stays under 0.7.

What are the key pitfalls to avoid when building ensembles?

The skill warns against data leakage (never using test data when building the ensemble), overfitting (too many base models or overly complex meta-learners), redundant models (similar models that erode diversity benefits), and unbalanced weights (some models dominating the ensemble). It also emphasizes using proper cross-validation for stacking to prevent overfitting.

Does this skill cover training individual base models?

No, this skill assumes reasonable base models already exist and focuses only on how to combine them. It is not a guide to training individual base models, feature engineering, or hyperparameter tuning per algorithm.

Skill

Build Advanced Ensemble Machine Learning Models

Name: Ensemble Model Builder Agent
Availability: OnlineOnly
Author: VibeBaza

Skill for building voting, stacking, and dynamically-selected ensemble models with weight optimization and diversity diagnostics.

Get skill

Works with githubsklearn

VibeBaza

Own this? Claim it

Spark score

out of 100

Updated 7 months ago

Fresher alternatives ↓

Version 1.0.0

Models

claude

Add to Favorites

Why it matters

Automate the creation of sophisticated ensemble machine learning models by intelligently combining diverse algorithms and data subsets to achieve superior predictive performance and optimize bias-variance trade-offs.

Outcomes

What it gets done

Design and implement voting classifiers and regressors.

Construct advanced stacking architectures with multi-level meta-learners.

Develop dynamic ensemble selection strategies based on model competence.

Optimize ensemble weights using Bayesian methods for improved accuracy.

Install

Add it to your toolbox

Run in your project directory:

curl -fsSL https://spark.entire.vc/get/vb-ensemble-model-builder | bash

Overview

Ensemble Model Builder Agent

A skill for building ML ensembles - voting classifiers, stacking with out-of-fold meta-features, dynamic competence-based model selection, and ensemble-weight optimization, with diagnostics for model diversity via pairwise correlation. Use it when combining already-trained models into an ensemble, not for training or tuning the individual base models themselves.

What it does

This skill covers ML ensemble methods - combining multiple models for superior predictive performance - spanning voting classifiers, bagging, boosting, stacking, and advanced ensemble architectures, grounded in bias-variance tradeoffs and model-diversity principles. Diversity requirements covered: algorithm diversity (combining fundamentally different algorithms - tree-based, linear, neural), data diversity (different subsets/features/representations of training data), hyperparameter diversity (varying model configs to capture different patterns), and training diversity (different random seeds, CV folds, or bootstrap samples). Bias-variance guidance: pair high-bias models (linear) with low-bias models (trees) for a balanced ensemble; bagging reduces variance, boosting reduces bias; stacking learns optimal combination weights.

Voting ensembles are demonstrated with scikit-learn's VotingClassifier/VotingRegressor, combining diverse base models (logistic regression, random forest, SVM with probability=True) into a soft-voting ensemble with per-model weights, evaluated against individual models via cross_val_score. Advanced stacking is covered through an AdvancedStacker class that generates out-of-fold meta-features via k-fold cross-validation (cloning and refitting each base model per fold to avoid leakage), then trains a meta-model on those meta-features:

from sklearn.linear_model import Ridge
from xgboost import XGBClassifier

base_models = [
    ('rf', RandomForestClassifier(n_estimators=100, max_depth=5)),
    ('xgb', XGBClassifier(n_estimators=100, max_depth=3)),
    ('lr', LogisticRegression(C=0.1))
]

Dynamic ensemble selection is covered via a DynamicEnsemble class that uses k-nearest-neighbors to estimate each base model's local competence (accuracy in the neighborhood of a query point) and weights predictions accordingly at inference time.

Ensemble-weight optimization is shown via optimize_ensemble_weights, which uses scipy.optimize.minimize to find weights that minimize validation log-loss (or maximize accuracy), constrained to sum to 1 and stay non-negative. Diagnostics are covered via ensemble_diagnostics, which computes AUC/log-loss/accuracy per base model plus a pairwise prediction-correlation matrix - lower average correlation signals more diversity.

Best practices: start with voting ensembles before advanced stacking; validate diversity by keeping base-model correlation under 0.7; always use proper cross-validation for stacking to prevent overfitting; engineer different feature sets per base model; and balance model complexity against training-time budget. Pitfalls to avoid: data leakage (never use test data when building the ensemble), overfitting (too many base models or an overly complex meta-learner), redundant models (many similar models erode the diversity benefit), and unbalanced weights (some models dominating the ensemble). Production considerations: version base models and ensemble weights, consider parallel inference for independent base models, manage memory carefully for large ensembles, and A/B test the ensemble against the best individual model.

When to use - and when NOT to

Use it when combining multiple trained models into a voting, stacking, or dynamically-selected ensemble to improve predictive performance beyond any single model. It is not a guide to training the individual base models themselves (feature engineering, hyperparameter tuning per algorithm) - it assumes reasonable base models exist and focuses on how to combine them.

Inputs and outputs

Given a set of trained base models and validation data, it returns ensemble code (voting, stacking, or dynamic selection), optimized combination weights, and diagnostics - per-model AUC/log-loss/accuracy plus a pairwise correlation matrix to check diversity.

Integrations

Code samples run on scikit-learn (VotingClassifier, RandomForestClassifier, LogisticRegression, SVC, KFold, NearestNeighbors), xgboost (XGBClassifier), scipy.optimize for weight optimization, and pandas/numpy for correlation diagnostics.

Who it's for

ML engineers and data scientists building ensemble models who need to combine several trained models for better predictive performance than any one model alone.

FAQ

Common questions

Discussion