📅 2026-04-16🤖 claude-opus-4problem1researchdatasetskaggle

公开数据集参考

用于算法验证、基线对比和申请书可行性论证的公开 BLE/WiFi 定位数据集。


BLE RSSI 数据集

★★★★★ Position-Annotated BLE RSSI Dataset

  • 链接https://www.kaggle.com/datasets/philotuxo/positionannotatedblerssidataset
  • 内容:BLE RSSI + 连续 (x, y) 坐标标注,含时间戳
  • 规模:~7,500+ 样本
  • 价值最相关——提供连续位置真值,可直接评估非线性最小二乘优化 x^=arg minωi(f(ri)xIi)2\hat{x} = \argmin \sum \omega_i(f(r_i) - |x - I_i|)^2

★★★★★ BLE RSSI Dataset for Indoor Localization

★★★☆☆ BLE Packets from Tracking Devices

  • 链接https://www.kaggle.com/datasets/stefansaxer/ble-packets-from-tracking-devices
  • 内容:商用追踪设备(AirTag、Tile 等)的原始 BLE 广播包,含 timestamp、MAC、RSSI、TX power、payload
  • 规模:~300K+ 包
  • 价值:最接近真实众包场景的原始信号数据,可研究 RSSI 方差、设备异构性、MAC 随机化。无位置标注,需配合其他数据使用

WiFi 指纹数据集(数学结构相同,可验证算法框架)

★★★☆☆ UJIndoorLoc (WiFi Fingerprinting)


ML 建模参考

★★★★☆ ABC Location Model Building


对申请书的意义

  • Position-Annotated BLE RSSI 可作为方案验证的主要数据源——有连续坐标真值
  • BLE Packets from Tracking Devices 可用于分析真实设备异构性(AirTag vs Tile vs 其他)
  • UJIndoorLoc 可作为大规模压力测试(520 AP、2 万样本)
  • 若出题方不提供数据(Q2 待确认),这些数据集可支撑初步实验