Please also check this folder intermediate_files
for the intermediate data files and code for data preprocessing.
- get_link_info_feature_beijing.py: for each link, extract link info feature;
- get_query_distribution_feature_beijing_1km_seq.py: get query distribution feature (algorithm 1);
- get_query_info_beijing.py: get query information;
- new_anomalty1109.py: event discovery algorithm;
- query_d.py: import the query data into the dict;
- s_grid_and_d_grid.py: import the query data into the source dict and destination dict, respectively;
- filter_around_link_set_beijing_1km.py: filter the link set so that each link has the traffic data;
- filter_around_traffic_beijing.py: filter the link with the ratio of the data completion (true, >90%; false, otherwise);
- get_around_traffic_beijing_mv_avg_1km.py: import the traffic data from text files, and use moving average to smooth it and dump the traffic data to pkl;