4.3. Logistic regression models for non-captive stations
It can be seen from Table 7 that the correlation between distance and travel time, travel mode and time, purpose and InboundOut and InBoundOut and travelFeeD are 0.5, _0.45, _0.5 and _0.35 respectively. Therefore, travel time and InboundOut were removed for model selection. Three variables were identified to be significant from the best fitting logistic regression model for non-captive stations (see Table 8). There are 559 records for the non-captive stations (Table 2), but the sample size for this regression model is 486 with 73 missing records being removed for the purpose of the analysis. The most influential variable is travel cost (from a chosen station to a destination). The less the cost of travelling from a chosen station to a destination, the more likely a chosen station will be a non-nearest station, which is consistent with the results from the overall model. However, different from the model for all chosen station, cost (origin to station) was found to be significant. The less the cost from origin to the chosen station, the less likely it is that chosen stations will be the nearest station.