合集:行业AI数据集精选
本文精选临床医学领域最受欢迎90+机器学习数据集,这些数据集来自具有重要影响力的学会、会议、数据库、期刊、国内外AI竞赛组织方、Github和Kaggle等数据集托管方。
一、医学组织
获取医疗行业众多具有重要影响力的学会、会议、数据库和期刊信息。
二、临床医学领域机器学习数据集
1. Pima Indians Diabetes Database
- 星标数: ⭐ 4,997
- 简介: 根据诊断指标预测糖尿病的发病情况。
- 主题: india, healthcare, earth and nature, health, diabetes
- 协议: CC0: Public Domain 所有者: UCI Machine Learning
- 链接: https://kaggle.com/datasets/uciml/pima-indians-diabetes-database
2. Breast Cancer Wisconsin (Diagnostic) Data Set
- 星标数: ⭐ 4,010
- 简介: 预测癌症是良性还是恶性
- 主题: healthcare, cancer
- 协议: CC BY-NC-SA 4.0 所有者: UCI Machine Learning
- 链接: https://kaggle.com/datasets/uciml/breast-cancer-wisconsin-data
3. Stroke Prediction Dataset
- 星标数: ⭐ 3,519
- 简介: 预测卒中事件的11项临床特征
- 主题: healthcare, public health, health, binary classification, health conditions
- 协议: Data files © Original Authors 所有者: fedesoriano
- 链接: https://kaggle.com/datasets/fedesoriano/stroke-prediction-dataset
4. Medical Cost Personal Datasets
- 星标数: ⭐ 3,159
- 简介: 使用线性回归进行保险预测
- 主题: healthcare, education, finance, health, insurance
- 协议: Database: Open Database, Contents: Database Contents 所有者: Miri Choi
- 链接: https://kaggle.com/datasets/mirichoi0218/insurance
5. Heart Failure Prediction Dataset
- 星标数: ⭐ 3,140
- 简介: 预测心脏病事件的11项临床特征。
- 主题: healthcare, health, classification, health conditions, heart conditions
- 协议: Database: Open Database, Contents: © Original Authors 所有者: fedesoriano
- 链接: https://kaggle.com/datasets/fedesoriano/heart-failure-prediction
6. Heart Failure Prediction
- 星标数: ⭐ 2,477
- 简介: 预测死亡事件的12项临床特征。
- 主题: healthcare, public health, health, tabular, heart conditions
- 协议: Attribution 4.0 International (CC BY 4.0) 所有者: Larxel
- 链接: https://kaggle.com/datasets/andrewmvd/heart-failure-clinical-data
7. Cardiovascular Disease dataset
- 星标数: ⭐ 1,389
- 简介: 该数据集包含70,000条患者数据记录,涵盖11个特征变量及1个目标变量。
- 主题: healthcare, health, heart conditions
- 协议: Unknown 所有者: Svetlana Ulianova
- 链接: https://kaggle.com/datasets/sulianova/cardiovascular-disease-dataset
8. Medical Appointment No Shows
- 星标数: ⭐ 1,104
- 简介: 为什么有30%的患者会错过预约时间?
- 主题: brazil, healthcare, public health, health
- 协议: CC BY-NC-SA 4.0 所有者: JoniHoppen
- 链接: https://kaggle.com/datasets/joniarroba/noshowappointments
9. COVID-19 Dataset
- 星标数: ⭐ 877
- 简介: 新冠肺炎患者的症状、状况及既往病史。
- 主题: diseases, health, classification, covid19
- 协议: CC0: Public Domain 所有者: Meir Nizri
- 链接: https://kaggle.com/datasets/meirnizri/covid19-dataset
10. mimic3-benchmarks
- 星标数: ⭐ 876
- 简介: 用于从MIMIC-III临床数据库构建基准机器学习数据集的Python套件。💊
- 主题: benchmark, clinical-data, deep-learning, machine-learning
- 协议: MIT License 所有者: YerevaNN
- 链接: https://github.com/YerevaNN/mimic3-benchmarks
11. Diabetes prediction dataset
- 星标数: ⭐ 851
- 简介: 基于医疗与人口统计数据的糖尿病预测综合数据集
- 主题: healthcare, health, classification, binary classification, diabetes
- 协议: Data files © Original Authors 所有者: Mohammed Mustafa
- 链接: https://kaggle.com/datasets/iammustafatz/diabetes-prediction-dataset
12. Health Insurance Marketplace
- 星标数: ⭐ 716
- 简介: 探索美国医疗保险市场中健康与牙科计划的数据
- 主题: healthcare, dentistry, earth and nature, business, economics
- 协议: CC0: Public Domain 所有者: US Department of Health and Human Services
- 链接: https://kaggle.com/datasets/hhs/health-insurance-marketplace
13. Fetal Health Classification
- 星标数: ⭐ 684
- 简介: 利用CTG数据将胎儿健康状况分类为正常、可疑或病理状态。
- 主题: healthcare, public health, health, mortality, tabular
- 协议: Other (specified in description) 所有者: Larxel
- 链接: https://kaggle.com/datasets/andrewmvd/fetal-health-classification
14. Breast Cancer Dataset
- 星标数: ⭐ 656
- 简介: 乳腺癌类型的二元分类预测
- 主题: healthcare, classification, tabular, binary classification, cancer
- 协议: CC0: Public Domain 所有者: M Yasser H
- 链接: https://kaggle.com/datasets/yasserh/breast-cancer-dataset
15. Heartbeat Sounds
- 星标数: ⭐ 605
- 简介: 从听诊器音频中分类心跳异常
- 主题: music, healthcare, earth and nature, health, classification
- 协议: CC0: Public Domain 所有者: Ed King
- 链接: https://kaggle.com/datasets/kinguistics/heartbeat-sounds
16. Cervical Cancer Risk Classification
- 星标数: ⭐ 595
- 简介: 癌症指标预测;请下载;运行内核并点赞
- 主题: healthcare, genetics, cancer
- 协议: Other (specified in description) 所有者: Gokagglers
- 链接: https://kaggle.com/datasets/loveall/cervical-cancer-risk-classification
17. Respiratory Sound Database
- 星标数: ⭐ 561
- 简介: 利用音频记录检测呼吸系统疾病。
- 主题: healthcare, earth and nature, biology, health, multiclass classification
- 协议: Unknown 所有者: vbookshelf
- 链接: https://kaggle.com/datasets/vbookshelf/respiratory-sound-database
18. Logistic regression To predict heart disease
- 星标数: ⭐ 551
- 简介: 心脏病预测
- 主题: healthcare, health, logistic regression, regression, health conditions
- 协议: Unknown 所有者: Dileep
- 链接: https://kaggle.com/datasets/dileep070/heart-disease-prediction-using-logistic-regression
19. Disease Symptom Prediction
- 星标数: ⭐ 526
- 简介: 有助于构建疾病预测或医疗保健系统
- 主题: healthcare, diseases, health, classification, recommender systems
- 协议: CC BY-SA 4.0 所有者: Pranay Patil
- 链接: https://kaggle.com/datasets/itachi9604/disease-symptom-description-dataset
20. Diagnosis of COVID-19 and its clinical spectrum
- 星标数: ⭐ 495
- 简介: 人工智能与数据科学辅助临床决策(3月28日至4月3日)
- 主题: healthcare, public health, earth and nature, health, classification
- 协议: Unknown 所有者: Einstein Data4u
- 链接: https://kaggle.com/datasets/einsteindata4u/covid19
21. Student Stress Monitoring Datasets
- 星标数: ⭐ 488
- 简介: 压力、福祉因素、根本原因及其影响的综合关系
- 主题: mental health, healthcare, health, artificial intelligence, computer science
- 协议: Apache 2.0 所有者: Sultanul Ovi
- 链接: https://kaggle.com/datasets/mdsultanulislamovi/student-stress-monitoring-datasets
22. Indian Liver Patient Records
- 星标数: ⭐ 480
- 简介: 收集自印度安得拉邦东北部的患者记录
- 主题: healthcare, health, medicine, health conditions, cancer
- 协议: CC0: Public Domain 所有者: UCI Machine Learning
- 链接: https://kaggle.com/datasets/uciml/indian-liver-patient-records
23. Heart Attack Prediction
- 星标数: ⭐ 455
- 简介: 该文件描述了心脏病目录的内容。
- 主题: healthcare, health, health conditions, heart conditions, benchmark dataset
- 协议: CC0: Public Domain 所有者: Nikhil Anand
- 链接: https://kaggle.com/datasets/imnikhilanand/heart-attack-prediction
24. Polycystic ovary syndrome (PCOS)
- 星标数: ⭐ 436
- 简介: 多囊卵巢综合征数据集包含患者的所有生理和临床参数。
- 主题: research, diseases
- 协议: CC BY-NC-SA 4.0 所有者: prasoon kottarathil
- 链接: https://kaggle.com/datasets/prasoonkottarathil/polycystic-ovary-syndrome-pcos
25. Hospital Beds Management
- 星标数: ⭐ 404
- 简介: 医院模拟数据集研究:工作量、患者流量与床位容量分析
- 主题: healthcare, health
- 协议: CC0: Public Domain 所有者: Weiwei Zhu
- 链接: https://kaggle.com/datasets/jaderz/hospital-beds-management
26. Medicare Data
- 星标数: ⭐ 399
- 简介: 医疗保险数据(BigQuery数据集)
- 主题: healthcare, health, bigquery, drugs and medications
- 协议: CC0: Public Domain 所有者: Centers for Medicare & Medicaid Services
- 链接: https://kaggle.com/datasets/cms/cms-medicare
27. UCI Heart Disease Data
- 星标数: ⭐ 390
- 简介: 来自UCI数据仓库的心脏病数据集
- 主题: healthcare, health, medicine, feature engineering, tabular
- 协议: Data files © Original Authors 所有者: Redwan Sony
- 链接: https://kaggle.com/datasets/redwankarimsony/heart-disease-data
28. Pfizer Vaccine Tweets
- 星标数: ⭐ 390
- 简介: 辉瑞与BioNTech疫苗相关推文
- 主题: healthcare, public health, health, drugs and medications
- 协议: CC0: Public Domain 所有者: Gabriel Preda
- 链接: https://kaggle.com/datasets/gpreda/pfizer-vaccine-tweets
29. Breast Cancer Proteomes
- 星标数: ⭐ 374
- 简介: 将乳腺癌患者划分为不同的亚类
- 主题: healthcare, biology, chemistry, health, cancer
- 协议: Unknown 所有者: kajot
- 链接: https://kaggle.com/datasets/piotrgrabo/breastcancerproteomes
30. Disease Symptoms and Patient Profile Dataset
- 星标数: ⭐ 340
- 简介: 揭示患者与疾病之间错综复杂的关系,涵盖超过100种疾病。
- 主题: medicine, computer science, exploratory data analysis, classification, health conditions
- 协议: MIT 所有者: Laksika Tharmalingam
- 链接: https://kaggle.com/datasets/uom190346a/disease-symptoms-and-patient-profile-dataset
31. awesome-cancer-variant-resources
- 星标数: ⭐ 325
- 简介: 一个由社区维护的癌症临床知识库和数据库集合,专注于癌症变异研究。
- 主题: awesome-list, bioinformatics, cancer, cancer-genomics, cancer-variants
- 协议: MIT License 所有者: seandavi
- 链接: https://github.com/seandavi/awesome-cancer-variant-resources
32. HEALTHCARE PROVIDER FRAUD DETECTION ANALYSIS
- 星标数: ⭐ 322
- 简介: 医疗保健服务提供者欺诈检测分析
- 主题: healthcare, insurance
- 协议: CC0: Public Domain 所有者: Rohit Anand Gupta
- 链接: https://kaggle.com/datasets/rohitrox/healthcare-provider-fraud-detection-analysis
33. awesome-healthcare-ai
- 星标数: ⭐ 314
- 简介: 精选的优质开源医疗工具、算法、数据集及研究论文列表。
- 主题: awesome-list, awesome-lists, healthcare, healthcare-application, healthcare-datasets
- 协议: Creative Commons Zero v1.0 Universal 所有者: medtorch
- 链接: https://github.com/medtorch/awesome-healthcare-ai
34. AV : Healthcare Analytics
- 星标数: ⭐ 311
- 简介: 预测对车辆保险感兴趣的健康保险持有者
- 主题: healthcare, earth and nature, business, health, computer science
- 协议: Other (specified in description) 所有者: shivan kumar
- 链接: https://kaggle.com/datasets/shivan118/healthcare-analytics
35. AV : Healthcare Analytics II
- 星标数: ⭐ 306
- 简介: Analytics Vidhya 医疗分析黑客马拉松
- 主题: health, beginner, exploratory data analysis, multiclass classification
- 协议: Other (specified in description) 所有者: Neha Prabhavalkar
- 链接: https://kaggle.com/datasets/nehaprabhavalkar/av-healthcare-analytics-ii
36. Genetic Variant Classifications
- 星标数: ⭐ 303
- 简介: 预测某个变异是否会导致临床分类上的冲突。
- 主题: healthcare, genetics, earth and nature, biology, medicine
- 协议: CC0: Public Domain 所有者: Kevin Arvai
- 链接: https://kaggle.com/datasets/kevinarvai/clinvar-conflicting
37. Heart Attack Risk Prediction Dataset
- 星标数: ⭐ 292
- 简介: 利用多维度合成心脏病发作数据集解锁预测性洞察
- 主题: healthcare, public health, health, health conditions, heart conditions
- 协议: Other (specified in description) 所有者: Sourav Banerjee
- 链接: https://kaggle.com/datasets/iamsouravbanerjee/heart-attack-prediction-dataset
38. Lower Back Pain Symptoms Dataset
- 星标数: ⭐ 271
- 简介: 收集物理脊柱数据
- 主题: healthcare, health conditions
- 协议: Unknown 所有者: sammy123
- 链接: https://kaggle.com/datasets/sammy123/lower-back-pain-symptoms-dataset
39. Diabetes 130 US hospitals for years 1999-2008
- 星标数: ⭐ 256
- 简介: 糖尿病 – 再入院
- 主题: healthcare, health, diabetes
- 协议: CC0: Public Domain 所有者: Humberto Brandão, Ph.D.
- 链接: https://kaggle.com/datasets/brandao/diabetes
40. MIAS Mammography
- 星标数: ⭐ 256
- 简介: 寻找乳腺癌
- 主题: healthcare, health, health conditions, cancer
- 协议: Other (specified in description) 所有者: K Scott Mader
- 链接: https://kaggle.com/datasets/kmader/mias-mammography
41. Chronic illness: symptoms, treatments and triggers
- 星标数: ⭐ 235
- 简介: 治疗方法和环境压力如何影响症状表现?
- 主题: healthcare, diseases, health, medicine, health conditions
- 协议: CC BY-NC-SA 4.0 所有者: Flaredown
- 链接: https://kaggle.com/datasets/flaredown/flaredown-autoimmune-symptom-tracker
42. U.S. Healthcare Data
- 星标数: ⭐ 235
- 简介: 人口健康、疾病、药物、营养、健康计划
- 主题: united states, healthcare, diseases, nutrition, health
- 协议: CC0: Public Domain 所有者: BuryBuryZymon
- 链接: https://kaggle.com/datasets/maheshdadhich/us-healthcare-data
43. TSDB
- 星标数: ⭐ 233
- 简介: 一个Python工具箱仅需一行代码即可加载172个公开时间序列数据集,适用于机器学习和深度学习。这些数据集涵盖医疗健康、金融、电力、交通、天气等多个领域。
- 主题: classification, data-mining, database, deep-learning, forecasting
- 协议: BSD 3-Clause “New” or “Revised” License 所有者: WenjieDu
- 链接: https://github.com/WenjieDu/TSDB
44. Predict Diabetes
- 星标数: ⭐ 222
- 简介: 分析糖尿病数据库
- 主题: healthcare, health, exploratory data analysis, classification, diabetes
- 协议: CC0: Public Domain 所有者: Aman Chauhan
- 链接: https://kaggle.com/datasets/whenamancodes/predict-diabities
45. Hepatitis C Prediction Dataset
- 星标数: ⭐ 217
- 简介: 献血者与丙型肝炎患者的实验室检测值
- 主题: health and fitness, healthcare, cancer
- 协议: Database: Open Database, Contents: © Original Authors 所有者: fedesoriano
- 链接: https://kaggle.com/datasets/fedesoriano/hepatitis-c-dataset
46. Body Fat Prediction Dataset
- 星标数: ⭐ 212
- 简介: 252名男性的体脂估算与多项身体围度测量数据
- 主题: healthcare, public health, earth and nature, health, regression
- 协议: Data files © Original Authors 所有者: fedesoriano
- 链接: https://kaggle.com/datasets/fedesoriano/body-fat-prediction-dataset
47. american-healthcare-conundrum
- 星标数: ⭐ 211
- 简介: 调查性数据新闻:逐项量化美国医疗体系中的可避免浪费。基于CMS、OECD及联邦数据集的开源分析。目前已识别出986亿美元的可节约资金。
- 主题: cms-data, data-journalism, drug-pricing, health-policy, healthcare
- 协议: MIT License 所有者: rexrodeo
- 链接: https://github.com/rexrodeo/american-healthcare-conundrum
48. Covid-19 Case Surveillance Public Use Dataset
- 星标数: ⭐ 200
- 简介: 探索美国向疾病控制与预防中心报告的COVID-19病例人口统计趋势
- 主题: healthcare, public health, social science, tabular, covid19
- 协议: CC0: Public Domain 所有者: Möbius
- 链接: https://kaggle.com/datasets/arashnic/covid19-case-surveillance-public-use-dataset
49. COVID-19 patient pre-condition dataset
- 星标数: ⭐ 193
- 简介: 根据墨西哥政府数据集获得的数据
- 主题: health, logistic regression, covid19
- 协议: CC0: Public Domain 所有者: Tanmoy Mukherjee
- 链接: https://kaggle.com/datasets/tanmoyx/covid19-patient-precondition-dataset
50. Cirrhosis Prediction Dataset
- 星标数: ⭐ 187
- 简介: 预测肝硬化分期的18项临床特征
- 主题: healthcare, public health, health, multiclass classification, health conditions
- 协议: Data files © Original Authors 所有者: fedesoriano
- 链接: https://kaggle.com/datasets/fedesoriano/cirrhosis-prediction-dataset
51. U.S. Opiate Prescriptions/Overdoses
- 星标数: ⭐ 184
- 简介: 能否通过预测模型来拯救生命?
- 主题: healthcare, drugs and medications
- 协议: CC0: Public Domain 所有者: Alan “AJ” Pryor, Ph.D.
- 链接: https://kaggle.com/datasets/apryor6/us-opiate-prescriptions
52. Heart Attack Dataset
- 星标数: ⭐ 178
- 简介: 伊拉克埃尔比勒市Zheen医院
- 主题: healthcare, medicine, data visualization, data analytics, heart conditions
- 协议: Attribution 4.0 International (CC BY 4.0) 所有者: Fatemeh Mohammadinia
- 链接: https://kaggle.com/datasets/fatemehmohammadinia/heart-attack-dataset-tarik-a-rashid
53. COVID-19 – Clinical Data to assess diagnosis
- 星标数: ⭐ 177
- 简介: Data Intelligence Team提供的Sírio-Libanês人工智能与分析数据
- 主题: business, health, social science, medicine, classification
- 协议: Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) 所有者: Hospital Sírio-Libanês
- 链接: https://kaggle.com/datasets/Sírio-Libanes/covid19
54. Cuff-Less Blood Pressure Estimation
- 星标数: ⭐ 177
- 简介: 用于无袖带血压估计的预处理和清洁生命信号
- 主题: healthcare, health, health conditions, heart conditions
- 协议: Unknown 所有者: Mohammad Kachuee
- 链接: https://kaggle.com/datasets/mkachuee/BloodPressureDataset
55. Healthcare Insurance
- 星标数: ⭐ 170
- 简介: 我的数据集涉及全球医疗保健领域的不安全感问题,目前正在开发中。
- 主题: exploratory data analysis, data visualization, neural networks, health conditions, numpy
- 协议: CC0: Public Domain 所有者: willian oliveira
- 链接: https://kaggle.com/datasets/willianoliveiragibin/healthcare-insurance
56. Cannabis Strains
- 星标数: ⭐ 169
- 简介: 大麻品种数据集
- 主题: healthcare, government, health
- 协议: Unknown 所有者: Liam Larsen
- 链接: https://kaggle.com/datasets/kingburrito666/cannabis-strains
57. Diabetes Health Indicators Dataset
- 星标数: ⭐ 165
- 简介: 用于糖尿病风险分析的10万份患者记录综合数据集
- 主题: healthcare, classification, binary classification, regression, health conditions
- 协议: CC0: Public Domain 所有者: Mohan Krishna Thalla
- 链接: https://kaggle.com/datasets/mohankrishnathalla/diabetes-health-indicators-dataset
58. Breast Cancer Gene Expression Profiles (METABRIC)
- 星标数: ⭐ 159
- 简介: 1904名患者的临床特征、mRNA水平Z分数及基因突变情况
- 主题: genetics, biology, health, cancer
- 协议: Database: Open Database, Contents: Database Contents 所有者: Raghad Alharbi
- 链接: https://kaggle.com/datasets/raghadalharbi/breast-cancer-gene-expression-profiles-metabric
59. Anxiety and Depression Psychological Therapies
- 星标数: ⭐ 159
- 简介: 国家焦虑与抑郁临床审计 – 英国
- 主题: mental health
- 协议: Other (specified in description) 所有者: Marília Prata
- 链接: https://kaggle.com/datasets/mpwolke/cusersmarildownloadsanxietycsv
60. Autism Screening
- 星标数: ⭐ 158
- 简介: 根据筛查结果对自闭症患者进行分类。
- 主题: universities and colleges, healthcare, education
- 协议: CC0: Public Domain 所有者: Faizunnabi
- 链接: https://kaggle.com/datasets/faizunnabi/autism-screening
61. COVID-19 Clinical Trials dataset
- 星标数: ⭐ 152
- 简介: 全球范围内正在进行的与COVID-19相关的临床研究数据库
- 主题: healthcare, covid19
- 协议: Database: Open Database, Contents: Database Contents 所有者: Parul Pandey
- 链接: https://kaggle.com/datasets/parulpandey/covid19-clinical-trials-dataset
62. Thyroid Disease Data
- 星标数: ⭐ 151
- 简介: 患者人口统计学资料及血液检测结果,以及甲状腺疾病诊断。
- 主题: health, medicine, classification, tabular, cancer
- 协议: Attribution 4.0 International (CC BY 4.0) 所有者: jaina
- 链接: https://kaggle.com/datasets/jainaru/thyroid-disease-data
63. clinical-trial-outcome-prediction
- 星标数: ⭐ 149
- 简介: 用于临床试验批准概率预测的基准数据集及深度学习方法(分层交互网络,HINT),发表于《细胞模式》2022年。
- 主题: benchmark, benchmark-datasets, clinical-data, clinical-research, clinical-research-data-warehouse
- 协议: 未提供 所有者: futianfan
- 链接: https://github.com/futianfan/clinical-trial-outcome-prediction
64. Employee Attrition for Healthcare
- 星标数: ⭐ 146
- 简介: 基于直观特征构建性能优异的机器学习模型。
- 主题: healthcare, people and society, health, classification
- 协议: CC0: Public Domain 所有者: JohnM
- 链接: https://kaggle.com/datasets/jpmiller/employee-attrition-for-healthcare
65. Hospital ratings
- 星标数: ⭐ 142
- 简介: Medicare.gov网站上用于医院质量比较的官方数据集
- 主题: public health, finance, health, hospitals and treatment centers
- 协议: CC0: Public Domain 所有者: Center for Medicare and Medicaid
- 链接: https://kaggle.com/datasets/center-for-medicare-and-medicaid/hospital-ratings
66. Hospitals and beds in India (Statewise)
- 星标数: ⭐ 142
- 简介: 印度各邦的床位和医院数量统计。
- 主题: india, health, hospitals and treatment centers, covid19
- 协议: CC0: Public Domain 所有者: Dheeraj M Pai
- 链接: https://kaggle.com/datasets/dheerajmpai/hospitals-and-beds-in-india
67. Cirrhosis Patient Survival Prediction
- 星标数: ⭐ 140
- 简介: 利用17项临床特征预测肝硬化患者的生存率
- 主题: healthcare, health, mortality, classification, binary classification
- 协议: Attribution 4.0 International (CC BY 4.0) 所有者: Joakim Arvidsson
- 链接: https://kaggle.com/datasets/joebeachcapital/cirrhosis-patient-survival-prediction
68. Global Hospital Beds Capacity (for covid-19)
- 星标数: ⭐ 137
- 简介: 了解全球典型医院床位容量的基准
- 主题: healthcare, health, social science, covid19
- 协议: CC0: Public Domain 所有者: Igor Kiulian
- 链接: https://kaggle.com/datasets/ikiulian/global-hospital-beds-capacity-for-covid19
69. heart failure clinical records
- 星标数: ⭐ 130
- 简介: 心力衰竭临床记录
- 主题: health, heart conditions
- 协议: Other (specified in description) 所有者: Nima Pourmoradi
- 链接: https://kaggle.com/datasets/nimapourmoradi/heart-failure-clinical-records
70. Thyroid Disease Data
- 星标数: ⭐ 130
- 简介: 患者人口统计学特征及甲状腺疾病诊断相关的血液检测结果
- 主题: medicine, exploratory data analysis, data cleaning, data visualization, classification
- 协议: CC0: Public Domain 所有者: Emmanuel F. Werr
- 链接: https://kaggle.com/datasets/emmanuelfwerr/thyroid-disease-data
71. Lung Cancer Detection
- 星标数: ⭐ 130
- 简介: 使用机器学习进行肺癌预测
- 主题: healthcare, public health, categorical, health, cancer
- 协议: CC0: Public Domain 所有者: Jillani SofTech
- 链接: https://kaggle.com/datasets/jillanisofttech/lung-cancer-detection
72. Adverse Food Events
- 星标数: ⭐ 128
- 简介: 90,000起与产品相关的用户报告不良医疗事件
- 主题: healthcare, government, medicine, software
- 协议: CC0: Public Domain 所有者: Food and Drug Administration
- 链接: https://kaggle.com/datasets/fda/adverse-food-events
73. cardiobot
- 星标数: ⭐ 127
- 简介: 心脏健康聊天机器人基于精心筛选的心血管疾病相关数据集进行训练。它能针对用户查询提供情境感知且医学相关的回答,帮助患者和医疗从业者理解症状、治疗方案及预防措施。该模型经过微调,确保其响应始终围绕心血管健康领域展开。
- 主题: cardio, chatbot, python
- 协议: MIT License 所有者: stellarloop
- 链接: https://github.com/stellarloop/cardiobot
74. Breast Cancer Diagnosis Dataset – Wisconsin State
- 星标数: ⭐ 122
- 简介: 分析肿瘤特征以进行癌症检测
- 主题: united states, beginner, intermediate, cancer
- 协议: Other (specified in description) 所有者: Saurabh Badole
- 链接: https://kaggle.com/datasets/saurabhbadole/breast-cancer-wisconsin-state
75. Obesity Classification Dataset
- 星标数: ⭐ 119
- 简介: 多分类数据集
- 主题: health and fitness, healthcare, public health, health, health conditions
- 协议: Attribution 4.0 International (CC BY 4.0) 所有者: Sujith K Mandala
- 链接: https://kaggle.com/datasets/sujithmandala/obesity-classification-dataset
76. AIDS Virus Infection Prediction 💉
- 星标数: ⭐ 119
- 简介: 对患者是否感染进行分类。
- 主题: healthcare, health, beginner, data visualization, classification
- 协议: CC0: Public Domain 所有者: Aadarsh velu
- 链接: https://kaggle.com/datasets/aadarshvelu/aids-virus-infection-prediction
77. awesome-healthcare-datasets
- 星标数: ⭐ 117
- 简介: 一份精选的公共领域优秀医疗数据集列表。
- 主题: 未提供
- 协议: MIT License 所有者: nickls
- 链接: https://github.com/nickls/awesome-healthcare-datasets
78. Lung Cancer Dataset
- 星标数: ⭐ 117
- 简介: 肺癌风险评估与分析详细患者档案
- 主题: healthcare, computer science, health conditions, cancer
- 协议: Database: Open Database, Contents: © Original Authors 所有者: Akash Nath
- 链接: https://kaggle.com/datasets/akashnath29/lung-cancer-dataset
79. awesome-healthcare-datasets
- 星标数: ⭐ 116
- 简介: 医疗保健与生物医学数据集,用于人工智能/机器学习
- 主题: awesome-list, biomedical, clinical, datasets, healthcare
- 协议: Creative Commons Zero v1.0 Universal 所有者: geniusrise
- 链接: https://github.com/geniusrise/awesome-healthcare-datasets
80. COVID19 Daily Updates
- 星标数: ⭐ 116
- 简介: Daily updates of Coronavirus 2019-nCoV (a.k.a. COVID-19)
- 主题: healthcare, public health, news
- 协议: Data files © Original Authors 所有者: Gabriel Preda
- 链接: https://kaggle.com/datasets/gpreda/coronavirus-2019ncov
81. Pathogen Detection | Salmonella Enterica
- 星标数: ⭐ 116
- 简介: 病原体检测在疾病诊断中具有重要意义。
- 主题: genetics, biology
- 协议: Other (specified in description) 所有者: Mohamadreza Momeni
- 链接: https://kaggle.com/datasets/imtkaggleteam/pathogen-detection-salmonella-enterica
82. Real Breast Cancer Data
- 星标数: ⭐ 115
- 简介: 真实乳腺癌样本数据集,适用于医疗健康与癌症数据分析。
- 主题: diseases, categorical, health, tabular, cancer
- 协议: CC0: Public Domain 所有者: AM
- 链接: https://kaggle.com/datasets/amandam1/breastcancerdataset
83. Cancer Risk Factors Data
- 星标数: ⭐ 114
- 简介: 关联生活方式、环境与遗传因素的癌症风险数据集。
- 主题: healthcare, diseases, deep learning, cancer, pytorch
- 协议: Attribution 4.0 International (CC BY 4.0) 所有者: Tarek Masryo
- 链接: https://kaggle.com/datasets/tarekmasryo/cancer-risk-factors-dataset
84. Healthcare Diabetes Dataset
- 星标数: ⭐ 113
- 简介: 糖尿病风险评估综合数据集
- 主题: healthcare, exploratory data analysis, binary classification, regression, diabetes
- 协议: Apache 2.0 所有者: Nandita Pore
- 链接: https://kaggle.com/datasets/nanditapore/healthcare-diabetes
85. Medical Insurance Cost Prediction
- 星标数: ⭐ 111
- 简介: 10万名个体的健康、生活方式、保险、理赔及医疗费用数据
- 主题: healthcare, health, classification, regression, insurance
- 协议: CC0: Public Domain 所有者: Mohan Krishna Thalla
- 链接: https://kaggle.com/datasets/mohankrishnathalla/medical-insurance-cost-prediction
86. Coronavirus Records Dataset: 2021
- 星标数: ⭐ 111
- 简介: 疫情分析:2021年全球冠状病毒记录数据集(按大洲划分)
- 主题: healthcare, earth and nature, health, medicine, beginner
- 协议: Other (specified in description) 所有者: Sourav Banerjee
- 链接: https://kaggle.com/datasets/iamsouravbanerjee/covid19-dataset-world-and-continent-wise
87. Predict survival of patients with heart failure
- 星标数: ⭐ 110
- 简介: 心力衰竭临床记录
- 主题: health, classification, clustering, regression, health conditions
- 协议: Attribution 4.0 International (CC BY 4.0) 所有者: Rabie El Kharoua
- 链接: https://kaggle.com/datasets/rabieelkharoua/predict-survival-of-patients-with-heart-failure
88. SAS-Clinical-Trials-Toolkit
- 星标数: ⭐ 109
- 简介: 临床试验应用中的SAS脚本,包括生成SDTM域、ADaM数据集以及Define.xml文件。
- 主题: 未提供
- 协议: GNU General Public License v3.0 所有者: wyp1125
- 链接: https://github.com/wyp1125/SAS-Clinical-Trials-Toolkit
89. ChEMBL EBI Small Molecules Database
- 星标数: ⭐ 109
- 简介: 用于药物发现的大规模生物活性数据库(BigQuery)
- 主题: healthcare, earth and nature, biology, chemistry, business
- 协议: CC BY-SA 4.0 所有者: Google BigQuery
- 链接: https://kaggle.com/datasets/bigquery/ebi-chembl
90. HeartHealthPrediction
- 星标数: ⭐ 107
- 简介: 在全球范围内,无论是发达国家还是欠发达国家,心脏病都是导致死亡的主要原因。数据科学家利用独特的机器学习技术,通过真实数据集高效且准确地对健康疾病进行建模。医疗分析师迫切需要能够预测患者发病前疾病风险的模型或系统。高胆固醇、不健康饮食、有害饮酒、高血糖、高血压以及吸烟是心脏病发病风险的主要征兆……
- 主题: data-science, decision-trees, healthcare, heart-health-prediction, meachinelearning
- 协议: 未提供 所有者: ammarmahmood1999
- 链接: https://github.com/ammarmahmood1999/HeartHealthPrediction
91. Global Health,Mortality & Disease Trend Since 2000
- 星标数: ⭐ 106
- 简介: 自2000年起各国健康、死亡率及人口指标
- 主题: healthcare, diseases, health, data analytics
- 协议: CC BY-SA 4.0 所有者: Shreyansh Dangi
- 链接: https://kaggle.com/datasets/shreyanshdangi/global-health-mortality-and-population-since-2000
92. Health Care Analytics
- 星标数: ⭐ 105
- 简介: 预测患者预后
- 主题: healthcare, health, data cleaning, ensembling, regression
- 协议: Data files © Original Authors 所有者: Abishek Sudarshan
- 链接: https://kaggle.com/datasets/abisheksudarshan/health-care-analytics
93. Diabetes_Dataset_With_18_Features
- 星标数: ⭐ 104
- 简介: 您可以使用此数据集构建糖尿病诊断模型。
- 主题: categorical, feature engineering, gradient boosting, binary classification, diabetes
- 协议: Other (specified in description) 所有者: Parisa Karimi Darabi
- 链接: https://kaggle.com/datasets/pkdarabi/diabetes-dataset-with-18-features
94. Clinical Dataset
- 星标数: ⭐ 102
- 简介: 发现队列和验证队列的临床数据
- 主题: medicine, drugs and medications, hospitals and treatment centers
- 协议: Other (specified in description) 所有者: Mohamadreza Momeni
- 链接: https://kaggle.com/datasets/imtkaggleteam/clinical-dataset
95. Laryngeal Voice Disorder Classification
- 星标数: ⭐ 100
- 简介: 喉部嗓音障碍数据集
- 主题: health, beginner, intermediate, health conditions, hospitals and treatment centers
- 协议: Other (specified in description) 所有者: Daniil Krasnoproshin
- 链接: https://kaggle.com/datasets/daniilkrasnoproshin/healthy-vs-laryngeal-disorder-classification