다운로드가 가능한 정답셋이 있는(labeling 된) 공개 데이터셋 중에서, 신뢰성이 높으며 비즈니스케이스 활용 가능한 학습데이터
1) HTTP CSIC 2010 Dataset for Intrusion detection (Security) - http://www.isi.csic.es/dataset/
2) Multi-Source Cyber-Security Events Dataset (Security) - http://csr.lanl.gov/data/cyber1/
3) Air Quality Dataset (Public sector) - http://archive.ics.uci.edu/ml/datasets/Air+Quality#
4) Gas Sensors for Home activity monitoring Dataset (Smart Home) - https://github.com/thmosqueiro/ENose-Decorr_Humdt_Temp
5) Bank Marketing Dataset (Marketing, Retail) - http://archive.ics.uci.edu/ml/datasets/Bank+Marketing#
6) Human Activity Recognition using smartphones Dataset (Marketing, Retail) - http://archive.ics.uci.edu/ml/datasets/Smartphone-Based+Recognition+of+Human+Activities+and+Postural+Transitions
7) Credit Card Client in Taiwan (6 months) Dataset (Marketing, Finance) - http://archive.ics.uci.edu/ml/datasets/default+of+credit+card+clients
8) Online Retail Dataset (Marketing, e-Commerce) - http://archive.ics.uci.edu/ml/datasets/Online+Retail
9) MIMIC (Medical database) - https://github.com/MIT-LCP/mimic-code / https://mimic.physionet.org/about/mimic/
• Health-related data associated with over 40k patients who stayed in critical care units of Beth Israel Deaconess Medical Center 2001-2012.
• Includes information about demographics, vital sign measurements (-1 data point per hour), lab test result, procedures, medications, caregiver notes, imaging reports, and mortality.
'Biusiness Insight > Data Science' 카테고리의 다른 글
Confusion matrix와 주요 성능지표 산출식 (0) | 2017.05.05 |
---|---|
[TensorFlow] 텐서플로우 관련 유용한 링크 (0) | 2017.03.12 |
Data Science & Machine Learning 관련 Coursera 추천 강의 리스트 (0) | 2016.03.20 |
IBM Watson 따라잡기 (0) | 2016.02.26 |
Artificial intelligence & Deep Learning (인공지능과 딥러닝) (0) | 2015.04.12 |