📅 2026-04-16🤖 claude-opus-4
公开数据集参考
用于算法验证、基线对比和申请书可行性论证的公开 BLE/WiFi 定位数据集。
BLE RSSI 数据集
★★★★★ Position-Annotated BLE RSSI Dataset
- 链接:https://www.kaggle.com/datasets/philotuxo/positionannotatedblerssidataset
- 内容:BLE RSSI + 连续 (x, y) 坐标标注,含时间戳
- 规模:~7,500+ 样本
- 价值:最相关——提供连续位置真值,可直接评估非线性最小二乘优化
★★★★★ BLE RSSI Dataset for Indoor Localization
- 链接:https://www.kaggle.com/datasets/mehdimka/ble-rssi-dataset
- Notebook:https://www.kaggle.com/code/somyaas/ble-rssi-dataset-for-indoor-localization
- 内容:13 个 iBeacon 的 RSSI 读数 + 房间/区域分类标签
- 规模:~1,420 样本,14 列
- 价值:BLE RSSI 基准数据集,适合评估分类方法(WKNN/SVM/RF)和指纹法基线
★★★☆☆ BLE Packets from Tracking Devices
- 链接:https://www.kaggle.com/datasets/stefansaxer/ble-packets-from-tracking-devices
- 内容:商用追踪设备(AirTag、Tile 等)的原始 BLE 广播包,含 timestamp、MAC、RSSI、TX power、payload
- 规模:~300K+ 包
- 价值:最接近真实众包场景的原始信号数据,可研究 RSSI 方差、设备异构性、MAC 随机化。无位置标注,需配合其他数据使用
WiFi 指纹数据集(数学结构相同,可验证算法框架)
★★★☆☆ UJIndoorLoc (WiFi Fingerprinting)
- Notebook:https://www.kaggle.com/code/ahmedelabed/indoor-localization-using-wi-fi-fingerprinting
- 内容:520 个 WiFi AP 的 RSSI + 经纬度/楼层/建筑标注
- 规模:~21,048 训练 + 1,111 测试样本
- 价值:规模最大的室内定位公开数据集,WiFi 而非 BLE,但指纹方法论和评估指标(CEP、楼层准确率)直接迁移
ML 建模参考
★★★★☆ ABC Location Model Building
- Notebook:https://www.kaggle.com/code/dihanpranon/abc-location-model-building
- 内容:基于 BLE iBeacon RSSI 数据的 ML 分类模型对比(KNN/SVM/RF)
- 价值:算法对比和超参数调优的教程参考
对申请书的意义
- Position-Annotated BLE RSSI 可作为方案验证的主要数据源——有连续坐标真值
- BLE Packets from Tracking Devices 可用于分析真实设备异构性(AirTag vs Tile vs 其他)
- UJIndoorLoc 可作为大规模压力测试(520 AP、2 万样本)
- 若出题方不提供数据(Q2 待确认),这些数据集可支撑初步实验