
Experimented with Azure Machine Learning to build a modular A/B testing pipeline that predicts UI performance and accelerates iteration for B2B products
🚫 The Problem with Traditional A/B Testing
High Cost: Enterprise A/B platforms can cost $5K–$10K per month
Slow Iteration: A single test may take 2+ weeks to complete
Data Demands: Requires large samples to reach significance
🤖 The Advantage of AI-Powered Testing (via Azure ML)
Data-Efficient: Delivers accurate predictions from small datasets
Historical Forecasting: Leverages past engagement to model performance
Lower Overhead: Reduces dependency on full-scale live experiments
✅ The Product Impact
Faster, Smarter Design Decisions: Predicts top-performing UI variants before launch
Actionable Insights: Enables confident product choices—without waiting
Scalable for B2B: Ideal for teams with limited traffic, budget, or time constraints

Pipeline flow architecture

Azure Machine Learning Studio Pipeline

“Split Data” component was configured so that 70% of the data is used for training purpose,
while the 30% of the data is used to test how well my model functions.

In the “Train Model” step, I specified the label column “Conversion Rate”
in the imported CSV file as the column used to train the model.

In the “Train Model” step, I specified the label column “Conversion Rate”
in the imported CSV file as the column used to train the model.
✅ Version A is a better choice as it outperformed Version B, showing higher conversions and greater predicted user engagement.
Conversion Rate (Avg):
0.1108 (Version A) vs. 0.1009 (Version B)
→ +9.8% improvement in conversions for Version A
Avg Scored Label (Predicted Engagement):
0.1270 (Version A) vs. 0.1062 (Version B)
→ Users are predicted to be more engaged with Version A

The successful pipeline gave the above results for the “Score Model” component,
including the predicted engagement (Scored Labels) of the Conversion Rate column

Connected Azure ML outputs to Power BI to visualize model results as a graph —
it shows that Version A outperformed Version B in both predicted engagement and conversion rate