GitScrum / Docs
All Best Practices

Data Science Teams | Experiment & Model Tracking

Data science teams need experiment tracking and flexible timelines. GitScrum supports time-boxed research, model versioning, and ML-to-engineering handoffs.

5 min read

Data science teams face unique challenges with iterative experiments, uncertain timelines, and research-heavy work. GitScrum adapts to these needs with flexible workflows, experiment tracking, and visibility into both research progress and production deployments.

Data Science Workflow

Work Categories

DATA SCIENCE TASK TYPES:
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚                                                             β”‚
β”‚ RESEARCH (Exploratory):                                     β”‚
β”‚ β€’ Uncertain outcomes                                       β”‚
β”‚ β€’ Time-boxed, not estimate-driven                          β”‚
β”‚ β€’ Success = learning, not just delivery                    β”‚
β”‚ Example: "Explore NLP approaches for sentiment (2 days)"   β”‚
β”‚                                                             β”‚
β”‚ EXPERIMENT (Hypothesis-driven):                             β”‚
β”‚ β€’ Clear hypothesis to test                                 β”‚
β”‚ β€’ Defined success metrics                                  β”‚
β”‚ β€’ May succeed or fail (both valuable)                      β”‚
β”‚ Example: "Test BERT vs GPT for classification"             β”‚
β”‚                                                             β”‚
β”‚ DEVELOPMENT (Production):                                   β”‚
β”‚ β€’ Traditional development estimation                       β”‚
β”‚ β€’ Build on validated experiments                           β”‚
β”‚ β€’ Clear deliverables                                       β”‚
β”‚ Example: "Implement recommendation API endpoint"           β”‚
β”‚                                                             β”‚
β”‚ MAINTENANCE (Operational):                                  β”‚
β”‚ β€’ Model monitoring and retraining                          β”‚
β”‚ β€’ Data pipeline maintenance                                β”‚
β”‚ β€’ Bug fixes and improvements                               β”‚
β”‚ Example: "Retrain fraud model with Q4 data"               β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Experiment Tracking

EXPERIMENT BOARD:
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚ IDEATION     β”‚ ACTIVE      β”‚ ANALYSIS   β”‚ DECISION         β”‚
β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€
β”‚              β”‚             β”‚            β”‚                  β”‚
β”‚ Clustering   β”‚ BERT vs GPT β”‚ Feature    β”‚ β†’ Productionize  β”‚
β”‚ approaches   β”‚ comparison  β”‚ selection  β”‚   gradient boost β”‚
β”‚              β”‚             β”‚ results    β”‚                  β”‚
β”‚ Graph-based  β”‚ Gradient    β”‚            β”‚ β†’ Abandon        β”‚
β”‚ recommender  β”‚ boosting    β”‚            β”‚   RNN approach   β”‚
β”‚              β”‚ optimizationβ”‚            β”‚                  β”‚
β”‚ Real-time    β”‚             β”‚            β”‚ β†’ More research  β”‚
β”‚ anomaly      β”‚             β”‚            β”‚   graph approach β”‚
β”‚ detection    β”‚             β”‚            β”‚                  β”‚
β”‚              β”‚             β”‚            β”‚                  β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Adapting Agile

Sprint Planning

DATA SCIENCE SPRINT STRUCTURE:
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚ 2-WEEK SPRINT                                               β”‚
β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€
β”‚                                                             β”‚
β”‚ ALLOCATION GUIDELINES:                                      β”‚
β”‚ β€’ 60% Committed work (production, maintenance)             β”‚
β”‚ β€’ 30% Experiments (time-boxed research)                    β”‚
β”‚ β€’ 10% Learning (papers, tools, upskilling)                 β”‚
β”‚                                                             β”‚
β”‚ SPRINT EXAMPLE:                                             β”‚
β”‚                                                             β”‚
β”‚ COMMITTED (60%):                                            β”‚
β”‚ β€’ Deploy recommendation model v2.3                         β”‚
β”‚ β€’ Fix data pipeline timeout issue                          β”‚
β”‚ β€’ Document model training process                          β”‚
β”‚                                                             β”‚
β”‚ EXPERIMENTS (30%):                                          β”‚
β”‚ β€’ Compare BERT vs GPT-2 for classification (3 days)        β”‚
β”‚   Success: Determine which performs better                 β”‚
β”‚ β€’ Explore graph features for fraud detection (2 days)      β”‚
β”‚   Success: Identify promising signals                      β”‚
β”‚                                                             β”‚
β”‚ LEARNING (10%):                                             β”‚
β”‚ β€’ Review recent papers on transformer efficiency           β”‚
β”‚ β€’ Explore new MLOps tooling                                β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Estimation Approach

ESTIMATION BY WORK TYPE:
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚                                                             β”‚
β”‚ RESEARCH/EXPERIMENTS:                                       β”‚
β”‚ Use TIME-BOXING:                                            β”‚
β”‚ "Spend 2 days exploring this. Report findings."            β”‚
β”‚ NOT: "Estimate how long to find a solution."               β”‚
β”‚                                                             β”‚
β”‚ Typical time boxes:                                         β”‚
β”‚ β€’ Quick spike: 4 hours                                     β”‚
β”‚ β€’ Standard experiment: 2-3 days                            β”‚
β”‚ β€’ Deep research: 1 week                                    β”‚
β”‚                                                             β”‚
β”‚ PRODUCTION DEVELOPMENT:                                     β”‚
β”‚ Use STORY POINTS:                                           β”‚
β”‚ β€’ Clear requirements                                       β”‚
β”‚ β€’ Known technology                                         β”‚
β”‚ β€’ Comparable to past work                                  β”‚
β”‚                                                             β”‚
β”‚ HANDLING UNCERTAINTY:                                       β”‚
β”‚ Phase 1: Explore (time-boxed) β†’ Learning                   β”‚
β”‚ Phase 2: Prototype (rough estimate) β†’ Working code         β”‚
β”‚ Phase 3: Productionize (firm estimate) β†’ Deployed          β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Model Development Workflow

Model Lifecycle

MODEL DEVELOPMENT STAGES:
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚                                                             β”‚
β”‚ PROBLEM DEFINITION                                          β”‚
β”‚ β”‚ β€’ Business problem clear                                 β”‚
β”‚ β”‚ β€’ Success metrics defined                                β”‚
β”‚ β”‚ β€’ Data availability confirmed                            β”‚
β”‚ β–Ό                                                          β”‚
β”‚ DATA EXPLORATION                                            β”‚
β”‚ β”‚ β€’ Understand data quality                                β”‚
β”‚ β”‚ β€’ Identify features                                      β”‚
β”‚ β”‚ β€’ Baseline established                                   β”‚
β”‚ β–Ό                                                          β”‚
β”‚ MODEL EXPERIMENTATION                                       β”‚
β”‚ β”‚ β€’ Try multiple approaches                                β”‚
β”‚ β”‚ β€’ Track experiments systematically                       β”‚
β”‚ β”‚ β€’ Select best performer                                  β”‚
β”‚ β–Ό                                                          β”‚
β”‚ MODEL DEVELOPMENT                                           β”‚
β”‚ β”‚ β€’ Production-ready code                                  β”‚
β”‚ β”‚ β€’ Testing and validation                                 β”‚
β”‚ β”‚ β€’ Documentation                                          β”‚
β”‚ β–Ό                                                          β”‚
β”‚ DEPLOYMENT                                                  β”‚
β”‚ β”‚ β€’ API/batch integration                                  β”‚
β”‚ β”‚ β€’ Monitoring setup                                       β”‚
β”‚ β”‚ β€’ A/B testing if applicable                              β”‚
β”‚ β–Ό                                                          β”‚
β”‚ MONITORING & ITERATION                                      β”‚
β”‚   β€’ Track model performance                                β”‚
β”‚   β€’ Detect drift                                           β”‚
β”‚   β€’ Plan retraining                                        β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Team Collaboration

DATA SCIENCE + ENGINEERING HANDOFF:
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚                                                             β”‚
β”‚ DATA SCIENCE DELIVERS:                                      β”‚
β”‚ βœ“ Trained model artifact                                   β”‚
β”‚ βœ“ Model card (performance, limitations)                    β”‚
β”‚ βœ“ Feature requirements                                     β”‚
β”‚ βœ“ Expected input/output formats                            β”‚
β”‚ βœ“ Performance benchmarks                                   β”‚
β”‚                                                             β”‚
β”‚ ENGINEERING PROVIDES:                                       β”‚
β”‚ βœ“ Feature pipeline infrastructure                          β”‚
β”‚ βœ“ Model serving platform                                   β”‚
β”‚ βœ“ Monitoring and alerting                                  β”‚
β”‚ βœ“ A/B testing framework                                    β”‚
β”‚ βœ“ Scaling and reliability                                  β”‚
β”‚                                                             β”‚
β”‚ SHARED RESPONSIBILITIES:                                    β”‚
β”‚ β€’ Integration testing                                      β”‚
β”‚ β€’ Performance optimization                                 β”‚
β”‚ β€’ Incident response                                        β”‚
β”‚ β€’ Documentation                                            β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Related Solutions