A Tag Mining framework for Disease Inference from Health related data

  • Ms. Vidhi L Chawda university of pune
  • Vishwanath S Mahalle
Keywords: Hidden layers, Community-based Health Services, Question Answering, Disease Inference and Deep Learning

Abstract

Normally people use Google to search their queries and that search engine respond them with the answer but that answer is in scattered format. User not gets exact answer for his / her queries. So we are going to implement this paper, we first report a user study on the information needs of health seekers in terms of questions and then select those that ask for possible diseases of their manifested symptoms for further analytic. We next propose a learning scheme to finding the possible diseases given the questions of health seekers. The proposed scheme comprises of two key components. The first globally mines the discriminate medical signatures from raw features. The second deems the raw features and their signatures as input nodes in one layer and hidden nodes in the subsequent layer, respectively. Meanwhile, it learns the inter-relations between these two layers via pre-training with pseudo labeled data. Following that, the hidden nodes serve as raw features for the more abstract signature mining. With incremental and alternative repeating of these two components, our scheme builds a sparsely connected deep architecture with three hidden layers. Overall, it well fits specific tasks with fine-tuning. Extensive experiments on a realworld dataset labeled by online doctors show the significant performance gains of our scheme.

References

[1] Liqiang Nie, Meng Wang, Luming Zhang, Shuicheng Yan, Member, IEEE, Bo Zhang, Senior Member, IEEE, TatSeng Chua, Senior Member, IEEE ”Disease Inference from Health-Related Questions via Sparse Deep Learning” IEEE Transactions on Knowledge and Data Engineering ,May 2014. [2] F. Wang, N. Lee, J. Hu, J. Sun, S. Ebadollah , and A. Laine, “A framework for mining signatures from event sequences and its applications in healthcare data,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013. [3] F. Wang, N. Lee, J. Hu, J. Sun, and S. Ebadollahi, “Towards heterogeneous temporal clinical event pattern discovery: A convolutional approach,” in The ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2012. [4] David Barbella1, Sami Benzaid2, Janara Christensen3, Bret Jackson4, X. Victor Qin “Understanding Support Vector Machine Classifications via a Recommender System-Like Approach” in Proceedings of the IJSR Conference, 2013. [5] Lejun Gong∗, Ronggen Yang, Qin Yan, and Xiao Sun, “Prioritization of Disease Susceptibility Genes Using LSM/SVD” in Proceedings of the IJSR Conference, 2011 [6] M.Shouman, T. Turner, and R. Stocker, “Using decision tree for diagnosing heart disease patients,” in Proceedings of the Australasian Data Mining Conference, 2011. [7] Y. Bengio, A. Courville, and P. Vincent, “Representation learning: A review and new perspectives,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013. [8] Refinement of the Facility-Level Medical Technology Score to Reflect Key Disease Response Capacity and Personnel Availability, Olumurejiwa A. Fatunde (Student Member, IEEE)1, And Timothy W. KOTIN (Student Member, IEEE) 2012.
[9] “Online health research eclipsing patient-doctor conversations,” Makovsky Health and Kelton, Survey, 2013. [10] L. Nie, M. Wang, Z. Zha, G. Li, and T.-S. Chua, “Multimedia answering: Enriching text qa with media information,” in Proceedings of the International ACM SIGIR Conference, 2011. [11] L. Nie, Y.-L. Zhao, X. Wang, J. Shen, and T.-S. Chua, “Learning to recommend descriptive tags for questions in social forums,”ACM Transactions on Information System, 2014. [12] P. Sondhi, J. Sun, H. Tong, and C. Zhai, “Sympgraph: A framework for mining clinical notes through symptom relation graphs,” in The ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2012. [13] L. Nie, M. Wang, Y. Gao, Z.-J. Zha and T.-S. Chua, “Beyond text qa: Multimedia answer generation by harvesting web information,” Multimedia, IEEE Transactions on, 2013. [14] D. Zhu and B. Carterette, “An adaptive evidence weighting method for medical record search,” in Proceedings of the International ACM SIGIR Conference, 2013. [15] R. Fakoor, F. Ladhak, A. Nazi, and M. Huber, “Using deep learning to enhance cancer diagnosis and classification,” in Proceedings of the International Conference on Machine Learning, 2013. [16] L. Nie, Y.-L. Zhao, M. Akbari, J. Shen, and T.-S. Chua, “Bridging the vocabulary gap between health seekers and healthcare knowledge,” IEEE Transactions on Knowledge and Data Engineering, 2014. [17] D. Erhan, Y. Bengio, A. Courville, P.-A. Manzagol, P. Vincent, and S. Bengio, “Why does unsupervised pretraining help deep learning?” Journal of Machine Learning Research, 2010. [18] Y. Zhang and B. Liu, “Semantic text classification of disease reporting,” in Proceedings of the International ACM SIGIR Conference, 2007. [19] T. C. Zhou, M. R. Lyu, and I. King, “A classification based approach to question routing in community question answering,” in The International World Wide Web Conference, 2012. [20] R. W. White and E. Horvitz, “Studies of the onset and persistence of medical concerns in search logs,” in Proceedings of the International ACM SIGIR Conference, 2012. [21] M.-A. Cartright, R. W. White, and E. Horvitz, “Intentions and attention in exploratory health search,” in Proceedings of the International ACM SIGIR Conference, 2011. [22] B. Koopman, P. Bruza, L. Sitbon, and M. Lawley, “Evaluating medical information retrieval,” in Proceedings of the International ACM SIGIR Conference, 2011. [23] C. B. Akgu¨ l, D. U¨ nay, and A. Ekin, “Automated diagnosis of alzheimer’s disease using image similarity and user feedback,” in Proceedings of the ACM International Conference on Image and Video Retrieval, 2009. [24] S. Doan and H. Xu, “Recognizing medication related entities in hospital discharge summaries using support vector machine,” in Proceedings of the International Conference on Computational Linguistics, 2010. [25] J. Zhou, L. Yuan, J. Liu, and J. Ye, “A multi-task learning formulation for predicting disease progression,” in The ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2011. [26] K. I. Penny and I. Atkinson, “Approaches for dealing with missing data in health care studies,” Journal of Clinical Nursing, 2012.
[27] H. Liu, L. Latecki, and S. Yan, “Fast detection of dense sub graphs with iterative shrinking and expansion,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013. [28] S. Ghumbre, C. Patil, and A. Ghatol, “Heart disease diagnosis using support vector machine,” in Proceedings of the International Conference on Computer Science and Information Technology, 2011.
Published
2018-03-20
How to Cite
Chawda, M. V., & Mahalle, V. (2018). A Tag Mining framework for Disease Inference from Health related data. Asian Journal For Convergence In Technology (AJCT) ISSN -2350-1146, 3(3). Retrieved from http://www.asianssr.org/index.php/ajct/article/view/227
Section
Article

Most read articles by the same author(s)

Obs.: This plugin requires at least one statistics/report plugin to be enabled. If your statistics plugins provide more than one metric then please also select a main metric on the admin's site settings page and/or on the journal manager's settings pages.