aBeacon Data Release

Large scale mobile sensing network is at the intersection of multiple research communities, including mobile computing, networked systems design and implementation, and spatial-temporal data mining. However, the progress of studies on large-scale mobile sensing networks does not meet industrial expectations. One of the main reasons is that there is no well-organized dataset that provides large amounts of sensing and mobility data from multiple channels to support related studies. Motivated by the observation, we release aBeacon: Alibaba Beacon System of Couriers’ Arrival Detection: a large-scale repository composed of BLE sensing, location trace, and manual report data, including 31,131 couriers at 2,466 merchant locations in one month. More details on the aBeacon system can be found in our NSDI’21 paper "From Conception to Retirement: a Lifetime Story of a 3-Year-Old Operational Wireless Beacon System in the Wild”.

Potential Topics:

  1. For aBeacon sensing data, we envision the following research topics:
    1. Bluetooth network topology;
    2. Impact of device type on Bluetooth monitoring;
    3. Relation between Bluetooth-based detection and courier report (with courier location report data);
    4. Relation between Bluetooth-based detection and GPS (with courier location trace data);
  2. For courier location trace data, we envision the following research topics:
    1. Courier Mobility Study (Cluster analysis/Destination prediction);
    2. Differences between e-scooter mobility and other transportation types (with other data-sets);
    3. Location privacy;
  3. For courier location report data, we envision the following research topics:
    1. Order scheduling strategy study;
    2. Order scheduling system efficiency (with courier location trace data);


  1. Ding Yi, Ling Liu, Yu Yang, Yunhuai Liu, Tian He, Desheng Zhang.
    From Conception to Retirement: a Lifetime Story of a 3-Year-Old Operational Wireless Beacon System in the Wild.
    In USENIX NSDI 2021.
  2. Yu Yang, Ding Yi, D.Yuan, Guang Wang, Xiaoyang Xie, Yunhuai Liu, Tian He and Desheng Zhang.
    TransLoc: Transparent Indoor Localization with Uncertain Human Participation.
    In ACM MobiCom'20.

ETC Data Release

This dataset contains ETC transaction samples for one day in Guangdong Province including: origin, destination, origin time, destination time, plate id. It includes 1,531,863 of vehicles and 2,515,672 of records. This dataset is for academic research only. All rights reserved. For privacy concerns, all identifiable IDs have been replaced by serial numbers.

We have been negotiating with service providers in Shenzhen to publish more data safely. But due to security and privacy concerns, some data used in the paper cannot be made public currently. Please follow our future work. Please cite the following paper when using this dataset.



  1. Yu Yang, Xaoyang Xie, Zhihan Fang, Fan Zhang, Yang Wang, Desheng Zhang.
    VeMo: Enabling Transparent Vehicular Mobility Modeling at Individual Levels with Full Penetration.
    In ACM MobiCom'19.
  2. Yu Yang, Fan Zhang, and Desheng Zhang.
    SharedEdge: GPS-free fine-grained travel time estimation in state-level highway systems.
    In ACM IMWUT/UbiComp'18.