Skip to content

neighbor-ryan/ctbk.dev

Repository files navigation

ctbk.dev Citi Bike Dashboard

  • ctbk/ contains a Python library and CLI (ctbk) that derives various datasets from Citi Bike's public data at s3://tripdata
  • s3://ctbk contains cleaned, public data output by ctbk
  • www/ contains the static web app served at ctbk.dev.
  • GitHub Actions in .github/workflows:
    • poll for new Citi Bike data at the start of each month
    • compute new derived data when found, and
    • build and publish the ctbk.dev website

Screenshots

ctbk.dev:

Screenshot of dashboard; per-month ride counts going back to June 2013, with a 12mo rolling avg showing mostly steady growth

Station/Ridership Map

ctbk.dev/stations:

Map of Citi Bike stations, Hoboken NJ Transit Terminal selected, showing destinations for rides beginning there

JC & Hoboken Only

ctbk.dev/?r=jh:

Screenshot of dashboard; per-month ride counts for Jersey City and Hoboken only, going back to June 2013, with a 12mo rolling avg showing mostly steady growth

Ride Minute %'s by Gender

ctbk.dev?d=1406-2102&g=mf&pct&s=g&y=m:

(Gender labels stopped appearing in the data in February 2021)

Ride %'s by User Type

ctbk.dev/?s=u&pct=:

Total Classic / E-bike Ride Minutes

ctbk.dev?y=m&s=b&rt=ce:

Stations by launch date

This is a work-in-progress; red = newer, yellow = older:

E-bike minutes by user type

ctbk.dev/?y=m&s=u&rt=e:

You can get some interesting upper bounds on e-bike fee revenue from this:

  • 6MM "member" e-bike minutes @ $0.16/min ⟹ ≈$1MM
  • 3MM "casual" e-bike minutes @ $0.26/min ⟹$750k

That doesn't count various reasons that e-bike minutes end up being free (positive bike angel points for the ride, no classic bikes available at station, etc.). With the discussion around recent price increases, you can do some envelope math like:

  • Suppose an e-bike costs $1000 (probably a low-ball estimate)
  • Suppose 10 rides per e-bike per day, and avg. 15mins / ride ⟹ $30 in e-bike fees per e-bike per day ⟹ 1-2 months to break even on each e-bike, before accounting for any operational cost of maintaining the e-bike fleet. Most likely the bikes don't break even for years, I'd guess…

Ride Durations

This isn't live anywhere yet, but:

The peak of the distribution (rounded down to the nearest 10s) is 4m20s. See notebook here.

Prior Art

Many great analyses of Citi Bike data have been done over the years!

My hope is that this dashboard will:

  • stay up to date automatically
  • support enough exploratory data analysis and visualization to answer most q's a layperson might have about system-wide stats

Feedback / Contributing

Feel free to file an issue here with any comments, bug reports, or feedback!