Analyzing the health of Philadelphia's bike share system


Published on August 16, 2015 by Dr. Randal S. Olson

bike share Indego machine learning modeling Philadelphia

4 min READ


Last month, I wrote about my initial attempts to model and predict the usage patterns of Indego, Philadelphia's new bike share system. To recap:

If you've ever used a bike share before, you know that one of the biggest fears is coming up to an empty bike share station when you need a bike. (Or similarly, coming up to a full station when you need to drop a bike off.) To help abate those fears, I've started monitoring the Indego bike usage API to see if I could model and predict when the bike share stations are most likely going to be empty or full.

A couple weeks after my post, another team of data analysts made an interesting suggestion to the bike sharers in Washington, D.C.: If you run into an empty or full bike share station, just wait 5 minutes and there should be an opening for you. Since I had the Indego data on handa, I decided to take a look at Philadelphia's bike share to see if the same advice holds here.

Below, I analyzed each Indego station individually and plotted the percentage of the time each station recovered from becoming empty or full within 5 minutes. Each dot represents a station; if you hover over one of the dots, the plot will tell you the name of the station, the percents, and the number of times each station became empty or full since July '15.

The takeaway from this plot is that if you see an empty or full Indego bike station, you're better off finding another station than waiting 5 minutes. In fact, in the past couple months you typically had to wait about 30 minutes at a station before it was no longer full or empty -- with shorter wait times during peak hoursb, of course.

Thankfully, most of the Indego bike stations are quite healthy. Below, I plotted the breakdown of how often each station is healthy (not empty nor full), empty, or full and sorted the stations by how the percentage of time they're healthy.

indego-pct-healthy-bar

The majority of the Indego bike stations have an uptime of 90% or higher, with the station at 18th & Fernon topping the list with a near-100% uptime. By far, the biggest problem Indego bike stations face is running out of bikes during the peak hours. The Children's Hospital of Philadelphia (CHOP) station, for example, frequently runs out of bikes in the evening when the hospital workers are heading back home. Similarly, commuting stations in the residential neighborhoods of Philadelphia -- such as 18th & Washington Ave -- have a strong tendency to fill up shortly after work hours.

To provide a better perspective on the Indego bike station downtimes, I plotted each station below. Again, each dot represents a station and you can hover over the dot to see the station names and statistics. The better-performing stations are on the bottom left of the chart, with near-zero downtimes.

Again, the critical issue of running out of bikes is highlighted in this chart: Some stations, such as 8th & Market and 18th & JFK, spend upwards of 25% of their time without a single bike. It's fair to assume that most of this downtime is during the evenings and overnight, but it's an alarming statistic nonetheless.

Despite these difficulties, I think the above findings are a great sign for the future of our burgeoning bike share program. There's currently more demand for the bike share than it can sometimes provide, which means there's tremendous opportunity for bike share growth in Philadelphia.

In the meantime, if you'd like to get the upper hand on the other bike sharers and find out the best times to take your bike routes, feel free to analyze my cache of the Indego usage data below.


Technical notes

a I have made my cache of the Indego bike share data set available online here. (Note: Sorry, I had to take down the cache, but I've left this text here for historical records.) The data is in tab-separated format, compressed with gzip, and contains all of the data from the API, captured every 5 minutes. The cache auto-updates every 5 minutes after the hour, every hour. Please don't send requests for the cache more than one time per hour; I will have to take it down if my web site slows down due to high volumes of requests.

b The peak usage hours for the Indego bike share are typically 8-10 AM and 4-6 PM ET on the weekdays. You can read more about the Indego usage patterns in my previous post.