wenmin-wu
GitHub profile for wenmin-wu11 skills
Are you wenmin-wu? Claim your skills.
wenmin-wu / timeseries-tweedie-objective-zero-inflated
Utilizes LightGBM's tweedie objective for accurate zero-inflated count forecasting, enhancing predictions for retail demand.
wenmin-wu / timeseries-wavelet-denoising
Denoises 1D time series data using wavelet decomposition and thresholding, enhancing trend extraction without lagging the signal.
wenmin-wu / tabular-sparse-dense-hstack-lgbm
Trains LightGBM on combined sparse and dense data, preserving categorical features for effective model performance.
wenmin-wu / timeseries-rolling-refit-arima-forecast
Implements walk-forward validation for ARIMA forecasting, ensuring accurate error distribution by refitting on historical data at each step.
wenmin-wu / tabular-simulated-annealing-multi-operator
Implements advanced simulated annealing with multiple move operators for effective combinatorial optimization.
wenmin-wu / tabular-strtree-spatial-index-collision
Utilizes Shapely's STRtree for efficient polygon overlap detection, improving performance from O(n^2) to O(n log n).
wenmin-wu / tabular-tfidf-svd-dense-text-features
Transforms sparse TF-IDF text vectors into dense components for efficient use in gradient boosting decision trees.
wenmin-wu / timeseries-kaggle-api-streaming-inference
Enables day-by-day predictions using Kaggle's iter_test API while managing a rolling history buffer for lag feature computation.
wenmin-wu / timeseries-snap-event-interaction-features
Builds interaction features for retail forecasting by analyzing SNAP event impacts on sales and revenue across states.
wenmin-wu / tabular-personnel-count-parsing
Parses structured text fields into separate numeric columns, useful for analyzing sports rosters and inventory data.
wenmin-wu / tabular-phase-based-strategy-cycling
Implements a phase-based strategy for game AI, enabling adaptive behavior through cycling between aggressive and economic phases.
wenmin-wu / tabular-play-direction-normalization
Normalizes spatial coordinates in datasets to ensure consistent play direction, enhancing analysis in sports and robotics.
wenmin-wu / tabular-polynomial-interaction-features
Generates polynomial and interaction features from numeric data to enhance model performance by capturing nonlinear relationships.
wenmin-wu / tabular-popularity-fallback-recommendation
Enhances recommendation systems by filling gaps with popular items, improving user experience for cold-start scenarios.
wenmin-wu / tabular-ppo-gym-wrapper-kaggle-env
Wraps Kaggle competitive game environments as OpenAI Gym environments for training PPO agents using stable-baselines3.
wenmin-wu / tabular-predicted-class-mass-reweighting
This skill corrects class imbalance in predictions by rescaling ensemble probabilities, enhancing model calibration for better accuracy.
wenmin-wu / tabular-prior-rebalancing-oversampling
Rebalances training data by oversampling the majority class to align with test-set class prior, enhancing model calibration.
wenmin-wu / tabular-pseudo-labeling
Enhances model training by using high-confidence predictions as pseudo labels, improving AUC in semi-supervised learning for tabular data.
wenmin-wu / tabular-rank-averaging-ensemble
Ensembles predictions from multiple models by converting to ranks, averaging, and normalizing, enhancing prediction robustness.
wenmin-wu / tabular-rank-calibrated-blending
Blends predictions from multiple models using rank calibration to ensure accurate and monotonic probability outputs.