Find a suitable model for long-term efficiency gains.
Senior Design · Wabtec rail efficiency study
AI Drivetrain Efficiency
Machine learning for smarter locomotive motor power distribution.
Michael Penfield · Joel Sander · Connor Pflugh · Jacob Harsch
Introduction
Wabtec builds and maintains locomotives that run in demanding environments.
- Global rail equipment, systems, and digital solutions provider.
- High-performance locomotives operate around the clock.
- Sustainability gains matter when multiplied across fleets.
Needs · Problem Illustrated
Even power splits assume every traction motor behaves the same.
Wear and operating conditions can make one motor less efficient than another.
Needs · AI Distribution
AI can shift the split toward motors that waste less power.
The model reads per-motor losses and returns a custom allocation.
Project Goal
Run a feasibility study for machine-learning-driven motor power distribution.
No real-world drivetrain dataset, so data is simulated.
Show gains on scenarios the model never saw during training.
Our Approach
Simulate, train, compare.
- 1Simulate train behavior.
- 2Train AI models on generated scenarios.
- 3Compare performance to an even-split baseline.
The Simulation
A virtual locomotive environment creates the learning signal.
Each motor receives a loss factor at scenario start, then the train advances through time.
Model Setup
Input losses. Output power fractions.
The action must distribute all available power without giving everything to one motor.
Positive when the AI beats the even split.
How the AI Learns
Trial, measure, adjust.
Choose a distribution.
Compare efficiency to baseline.
Update the policy toward better choices.
Models Tested
Five candidates covered memory, control, stability, and a supervised baseline.
Why Multiple Models?
Different algorithms fail for different reasons.
Discrete memory vs. continuous control.
Some models handle noisy scenarios better.
Variance and convergence decide whether a result is usable.
State-Based Candidates
Q-Table and SARSA make discrete policy jumps.
- Q-Table remembers best actions for known states.
- SARSA behaves more conservatively.
- Both struggle when smooth control is needed.
Continuous Candidates
DDPG and TD3 treat power allocation like dimmer switches.
- Actor chooses the power setting.
- Critic evaluates the choice.
- TD3 adds a second critic for stability.
Supervised Learning Check
MLP tested whether reinforcement learning was the right paradigm.
Labels were generated from the simulation equations using KKT optimization.
Smarter Training
Parallel learners compare results and keep the best ideas.
Multiple versions of the same model run at once, flushing out weak moves faster.
Training Strategy · Metaheuristic
Segment learning blends local discoveries into a global update.
Learn smaller training segments first.
Blend segment scores like pheromones.
Train globally with old and new information.
Best Performance Model
DDPG generalized after roughly 75 training episodes.
The orange line showed stronger gains, though variance suggests more training would help consistency.
Result Validation
Models were tested against new difficulty levels.
Each difficulty scales motor inefficiency to create easier or harder validation sets.
Final Results · Validation
DDPG consistently achieved the strongest validation results.
- Some models failed to converge.
- Others had too much variance.
- DDPG scaled across easy, medium, and hard cases.
Estimated Financial Impact
Small gains become meaningful across a fleet.
Annual fuel per line-haul locomotive
$600,000Baseline annual fuel cost per locomotive
Validated efficiency gain in a typical variation scenario
$7,440Annual savings per locomotive
Total annual fuel reduction
$7.44MTotal annual cost savings
Thank You
AI-guided power distribution is feasible, measurable, and worth deeper validation.
Next step: move from simulated losses to richer drivetrain data and real operating profiles.