TY - Jour A2 - Ali,Shaukat Au - Alam Khan,Zahid Au - Feng,Zhidyong Au - uddin,M. Irfan Au - Mast,Noor Au - Ali Shah,Syed Atif Au - Imtiaz,Muhammad Au - Al Khasawneh,MahmadAhmad Au - Mahmoud,Marwan Py - 2020年DA - 2020/11/28 Ti - 使用加固学习疾病预防最佳政策学习SP - 7627290 VL - 2020 AB - 疾病对人口的生活质量产生巨大影响。人类一直在寻求寻找策略,以避免危及生命或影响人类的生活质量的疾病。有效利用人类可用于控制不同疾病的资源一直至关重要。研究人员最近更有兴趣找到基于AI的解决方案来控制来自疾病的人群,由于深度学习的压倒性受欢迎。有许多监督技术始终用于疾病诊断。然而,基于监督的解决方案的主要问题是数据的可用性,这并不总是可以完整的。例如,我们没有足够的数据,显示人类和不同环境州的不同状态,以及人类或病毒所采取的所有不同行动最终导致了最终患者的生命。因此,需要查找基于无监督的解决方案或某些技术,这些解决方案不具有对底层数据集的依赖性。在本文中,我们探索了加强学习方法。 We have tried different reinforcement learning algorithms to research different solutions for the prevention of diseases in the simulation of the human population. We have explored different techniques for controlling the transmission of diseases and its effects on health in the human population simulated in an environment. Our algorithms have found out policies that are best for the human population to protect themselves from the transmission and infection of malaria. The paper concludes that deep learning-based algorithms such as Deep Deterministic Policy Gradient (DDPG) have outperformed traditional algorithms such as Q-Learning or SARSA. SN - 1058-9244 UR - https://doi.org/10.1155/2020/7627290 DO - 10.1155/2020/7627290 JF - Scientific Programming PB - Hindawi KW - ER -