Booking.com - Search & Ranking at Scale

Project Summary

Type: Enterprise Role (Senior Engineer)
Focus: Search & Ranking at Scale

Key Features:

Search and ranking improvements A/B tested across millions of users
Fixed critical data pipeline bug restoring ranking quality for entire platform
Migrated Spark workflows from on-premise to cloud infrastructure
Worked on core search engine serving millions of daily property searches
Improved filtering and relevance for home properties

As a Senior Engineer on Booking.com's Search & Ranking team, I worked on the core search engine that powers property discovery for millions of daily users globally. The challenge: continuously optimize ranking algorithms to improve user experience while maintaining data pipeline reliability at massive scale. The solution: rigorous A/B testing, robust data pipeline engineering, and cloud migration for improved scalability and reliability.

The Problem

Serving millions of daily property searches requires constantly optimizing ranking algorithms while maintaining data pipeline reliability at massive scale:

Millions of users depend on accurate, relevant search results every day
Ranking quality directly impacts business metrics—poor results mean lost bookings
Data pipeline reliability critical—any downtime or data quality issues degrade search quality
A/B testing at scale needed to validate improvements without risking user experience
Legacy infrastructure limitations require migration to cloud for better scalability
Complex data workflows processing massive datasets for feature engineering and model training

A single bug in the data pipeline can cause ranking quality to degrade across the entire platform, affecting millions of users and millions in revenue.

Architecture

flowchart TB
    subgraph search [Search Flow]
        query[User Query<br/>Millions Daily]
        search_engine[Search Engine<br/>Property Discovery]
        ranking_model[Ranking Model<br/>Relevance Scoring]
        ab_testing[A/B Testing Layer<br/>Experiment Framework]
        results[Search Results<br/>Optimized Rankings]
    end

    subgraph data_pipeline [Data Pipeline]
        raw_data[Raw Data<br/>Property Listings<br/>User Behavior]
        spark_processing[Spark Processing<br/>Feature Engineering]
        feature_store[(Feature Store<br/>Model Features)]
        model_training[Model Training<br/>Ranking Algorithms]
    end

    subgraph infrastructure [Infrastructure]
        on_premise[On-Premise<br/>Legacy Systems]
        cloud[Cloud Infrastructure<br/>Scalable & Reliable]
    end

    query --> search_engine
    search_engine --> ranking_model
    ranking_model --> ab_testing
    ab_testing --> results

    raw_data --> spark_processing
    spark_processing --> feature_store
    feature_store --> ranking_model
    feature_store --> model_training
    model_training --> ranking_model

    spark_processing -.->|Migration| cloud
    on_premise -.->|Migrated| cloud

Technical Approach

Search & Ranking Optimization

Worked on improving the core search and ranking algorithms that determine which properties users see:

A/B tested search improvements for home properties, improving filtering and relevance
Analyzed user behavior data to understand what makes results relevant
Iterated on ranking signals to better match user intent with property listings
Optimized filtering logic to reduce noise and improve result quality
Measured impact through rigorous experimentation and metrics analysis

The A/B testing framework allowed us to validate improvements safely, rolling out changes gradually and measuring impact on key metrics before full deployment.

Data Pipeline Engineering

Identified and fixed a critical Spark workflow bug that was causing an unrated properties surge—restoring quality ranking predictions across the platform:

Root cause analysis of data pipeline issues affecting ranking quality
Debugged complex Spark workflows processing massive datasets
Fixed data quality issues that were causing properties to be incorrectly rated or excluded
Restored ranking quality for the entire platform by fixing upstream data problems
Improved monitoring to catch similar issues earlier in the future

This bug fix had immediate impact—restoring proper ranking quality meant users saw more relevant results, directly improving user experience and business metrics.

A/B Testing at Scale

Designed and executed A/B tests across millions of users:

Experiment design ensuring statistical significance while minimizing risk
Traffic allocation strategies to test improvements safely
Metrics tracking across user engagement, booking conversion, and revenue
Rollout strategies for gradual deployment based on test results
Analysis frameworks to understand why changes worked or didn't work

The A/B testing infrastructure enabled rapid iteration on search improvements while maintaining platform stability and user experience.

Cloud Migration

Migrated on-premise Spark workflows to cloud infrastructure:

Assessed legacy systems to understand dependencies and migration requirements
Designed cloud architecture for improved scalability and reliability
Migrated Spark workflows with zero downtime
Improved system reliability through cloud-native features (auto-scaling, managed services)
Reduced operational overhead by leveraging cloud infrastructure management

The migration improved system reliability and scalability while reducing operational burden, enabling the team to focus on building features rather than managing infrastructure.

Results

Metric	Impact
Search improvements	A/B tested across millions of users with measurable improvements
Data pipeline bug fix	Restored ranking quality for entire platform
Cloud migration	Improved reliability and scalability of Spark workflows
Home property search	Improved filtering and relevance through targeted optimizations
Platform scale	Millions of daily searches served reliably

Working at one of the world's most data-driven companies, every change was measured and validated through rigorous experimentation. The improvements I contributed to directly impacted millions of users and millions in revenue.

Tech Stack

Java Apache Spark Cloud Infrastructure A/B Testing Data Pipelines

Key Learnings

At Booking.com's scale, small improvements to search relevance compound across millions of users and millions in revenue. The most impactful work wasn't just building new features—it was fixing critical bugs that degraded user experience across the entire platform. The data pipeline bug fix taught me that ranking quality is only as good as the data feeding it. Cloud migration showed how infrastructure improvements enable faster feature development and better reliability. The rigorous A/B testing culture ensured that every change was validated with real user data before deployment—a practice that should be standard at any data-driven company.

Need help with search, ranking, or data pipelines at scale?

I help companies build scalable search systems and reliable data pipelines. Let's discuss your challenges.

Book Free Intro Call