eBay 2025 University Machine Learning Competition

Overview

This project documents our submission for the eBay 2025 University Machine Learning Competition. The challenge focuses on a Named Entity Recognition (NER) task where models learn from marketplace listing titles to identify and label key product attributes to enhance shopping and discovery in German language.

For the full task definition, rules, and datasets, see the official challenge page: eBay 2025 University ML Competition on EvalAI

Our Approach (Top 4)

Due to the competition’s data usage agreement and model disclosure restrictions, we are unable to share implementation details, dataset-specific processing steps, or proprietary configurations. The description below intentionally remains high-level while reflecting the overall structure of our work.

Our solution was developed through an iterative, system-level process rather than a single fixed design. We treated the task as a learning and refinement problem, where modeling choices were continuously guided by empirical performance and error behavior.

Model Exploration

We evaluated a diverse set of open-source, pretrained German language models suitable for sequence labeling tasks. Instead of relying on a single architecture, we explored multiple representations to understand how different pretrained linguistic priors affected downstream performance.

Iterative Fine-Tuning

Rather than applying a single training recipe, we explored multiple fine-tuning strategies and training configurations, including cross-validation-based training, progressive fine-tuning, validation-based early stopping, and validation-driven model selection. These variations were assessed through controlled validation, with adjustments driven by observed failure patterns rather than static assumptions.

System-Level Optimization

We treated preprocessing, model training, and output refinement as a unified pipeline, where improvements at one stage could influence the behavior of the others. This allowed us to gradually stabilize predictions and reduce inconsistent outputs without relying on dataset-specific shortcuts.

Through repeated cycles of evaluation and adjustment—including model selection, training configuration changes, and output-level refinement—the system converged to a stable configuration that balanced precision and generalization.

Using this system-oriented strategy, our team, Init to Win, was able to achieve a Top 4 ranking with an F1 score of 0.940146.

Overview

Our Approach (Top 4)

Model Exploration

Iterative Fine-Tuning

System-Level Optimization

Refinement and Convergence