Dataset Improvements

I scaled from 10,000 β†’ 40,000 sessions, because 7 days don’t reveal retention patterns.

Main Goals of the day:

  • Scale simulation from 1 week β†’ 4 weeks (28 days)
  • Keep the number of players (1,200)
  • Simulate 40,000 sessions β†’ 33.3 avg sessions per player
  • Model progressive churn decay β€” not just β€œDay 7”, but Week 1 to Week 4

Step by Step

πŸ“ Step 1: Changed start = datetime(2025, 9, 1) β†’ end = datetime(2025, 9, 28)
πŸ“ Step 2: Adjusted Poisson lambda from 8.33 β†’ 33.3
πŸ“ Step 3: Kept n_players = 1200 β€” because real operators don’t get 10K new players every week
πŸ“ Step 4: Modified session duration distribution to reflect weekend spikes and mid-week drop-offs
πŸ“ Step 5: Adjusted bonus claim probability to decay after Week 1 β€” real players stop claiming after 7–10 days

Challenges / Insights

βœ… More time = better insight.

In 7 days, everyone looks like a loyal player.
In 28 days?
β†’ 76% drop after Week 1
β†’ 39% after Week 2
β†’ Only 15% still active at Week 4

That’s not noise.
That’s the real iGaming economy.

I used to think:

β€œI need more players.”

Now I know:

β€œI need more time.”

Because churn doesn’t happen on Day 1.
It happens after the bonus runs out.

Code Snippet Final

``` python
# New parameters
start = datetime(2025, 9, 1)
end = datetime(2025, 9, 28)  # 4 weeks
n_players = 1200
n_sessions = 40000
avg_session = n_sessions / n_players  # 33.3 β€” realistic for active players

´´´

Next Step

I have the data, the model, and the dashboard.

Next: Understanding the strengths and the weaknesses of my work.