Pharmaceutical wholesaler

Training a decision model for churn prediction

Starting point

  • 600 million data records of sales activities and purchase transactions
  • 26 tables, ~3,700 attributes
  • Churn definition:
    • Manually marked (notification)
    • Revenue decline > 40%"

Project result

  • Decision model with multiple components and an overall model accuracy of 86.49%

Role in the project

  • Analysis of the data base using statistical methods and domain expertise
    • Text mining on sales reports
    • Time series analysis (trends)
    • Classification on customer master data

Customer benefits

  • Identification of potential churn candidates
  • Enabling the initiation of countermeasures
  • Foundation for further analyses (more domain expertise)
  • Insights into current data quality and the need for data management
Jens Schnettler | Member of Executive Board