A Tag Mining framework for Disease Inference from Health related data
Abstract
Normally people use Google to search their queries and that search engine respond them with the answer but that answer is in scattered format. User not gets exact answer for his / her queries. So we are going to implement this paper, we first report a user study on the information needs of health seekers in terms of questions and then select those that ask for possible diseases of their manifested symptoms for further analytic. We next propose a learning scheme to finding the possible diseases given the questions of health seekers. The proposed scheme comprises of two key components. The first globally mines the discriminate medical signatures from raw features. The second deems the raw features and their signatures as input nodes in one layer and hidden nodes in the subsequent layer, respectively. Meanwhile, it learns the inter-relations between these two layers via pre-training with pseudo labeled data. Following that, the hidden nodes serve as raw features for the more abstract signature mining. With incremental and alternative repeating of these two components, our scheme builds a sparsely connected deep architecture with three hidden layers. Overall, it well fits specific tasks with fine-tuning. Extensive experiments on a realworld dataset labeled by online doctors show the significant performance gains of our scheme.
References
[9] “Online health research eclipsing patient-doctor conversations,” Makovsky Health and Kelton, Survey, 2013. [10] L. Nie, M. Wang, Z. Zha, G. Li, and T.-S. Chua, “Multimedia answering: Enriching text qa with media information,” in Proceedings of the International ACM SIGIR Conference, 2011. [11] L. Nie, Y.-L. Zhao, X. Wang, J. Shen, and T.-S. Chua, “Learning to recommend descriptive tags for questions in social forums,”ACM Transactions on Information System, 2014. [12] P. Sondhi, J. Sun, H. Tong, and C. Zhai, “Sympgraph: A framework for mining clinical notes through symptom relation graphs,” in The ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2012. [13] L. Nie, M. Wang, Y. Gao, Z.-J. Zha and T.-S. Chua, “Beyond text qa: Multimedia answer generation by harvesting web information,” Multimedia, IEEE Transactions on, 2013. [14] D. Zhu and B. Carterette, “An adaptive evidence weighting method for medical record search,” in Proceedings of the International ACM SIGIR Conference, 2013. [15] R. Fakoor, F. Ladhak, A. Nazi, and M. Huber, “Using deep learning to enhance cancer diagnosis and classification,” in Proceedings of the International Conference on Machine Learning, 2013. [16] L. Nie, Y.-L. Zhao, M. Akbari, J. Shen, and T.-S. Chua, “Bridging the vocabulary gap between health seekers and healthcare knowledge,” IEEE Transactions on Knowledge and Data Engineering, 2014. [17] D. Erhan, Y. Bengio, A. Courville, P.-A. Manzagol, P. Vincent, and S. Bengio, “Why does unsupervised pretraining help deep learning?” Journal of Machine Learning Research, 2010. [18] Y. Zhang and B. Liu, “Semantic text classification of disease reporting,” in Proceedings of the International ACM SIGIR Conference, 2007. [19] T. C. Zhou, M. R. Lyu, and I. King, “A classification based approach to question routing in community question answering,” in The International World Wide Web Conference, 2012. [20] R. W. White and E. Horvitz, “Studies of the onset and persistence of medical concerns in search logs,” in Proceedings of the International ACM SIGIR Conference, 2012. [21] M.-A. Cartright, R. W. White, and E. Horvitz, “Intentions and attention in exploratory health search,” in Proceedings of the International ACM SIGIR Conference, 2011. [22] B. Koopman, P. Bruza, L. Sitbon, and M. Lawley, “Evaluating medical information retrieval,” in Proceedings of the International ACM SIGIR Conference, 2011. [23] C. B. Akgu¨ l, D. U¨ nay, and A. Ekin, “Automated diagnosis of alzheimer’s disease using image similarity and user feedback,” in Proceedings of the ACM International Conference on Image and Video Retrieval, 2009. [24] S. Doan and H. Xu, “Recognizing medication related entities in hospital discharge summaries using support vector machine,” in Proceedings of the International Conference on Computational Linguistics, 2010. [25] J. Zhou, L. Yuan, J. Liu, and J. Ye, “A multi-task learning formulation for predicting disease progression,” in The ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2011. [26] K. I. Penny and I. Atkinson, “Approaches for dealing with missing data in health care studies,” Journal of Clinical Nursing, 2012.
[27] H. Liu, L. Latecki, and S. Yan, “Fast detection of dense sub graphs with iterative shrinking and expansion,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013. [28] S. Ghumbre, C. Patil, and A. Ghatol, “Heart disease diagnosis using support vector machine,” in Proceedings of the International Conference on Computer Science and Information Technology, 2011.
To ensure uniformity of treatment among all contributors, other forms may not be substituted for this form, nor may any wording of the form be changed. This form is intended for original material submitted to AJCT and must accompany any such material in order to be published by AJCT. Please read the form carefully.
The undersigned hereby assigns to the Asian Journal of Convergence in Technology Issues ("AJCT") all rights under copyright that may exist in and to the above Work, any revised or expanded derivative works submitted to AJCT by the undersigned based on the Work, and any associated written, audio and/or visual presentations or other enhancements accompanying the Work. The undersigned hereby warrants that the Work is original and that he/she is the author of the Work; to the extent the Work incorporates text passages, figures, data or other material from the works of others, the undersigned has obtained any necessary permission. See Retained Rights, below.
AUTHOR RESPONSIBILITIES
AJCT distributes its technical publications throughout the world and wants to ensure that the material submitted to its publications is properly available to the readership of those publications. Authors must ensure that The Work is their own and is original. It is the responsibility of the authors, not AJCT, to determine whether disclosure of their material requires the prior consent of other parties and, if so, to obtain it.
RETAINED RIGHTS/TERMS AND CONDITIONS
1. Authors/employers retain all proprietary rights in any process, procedure, or article of manufacture described in the Work.
2. Authors/employers may reproduce or authorize others to reproduce The Work and for the author's personal use or for company or organizational use, provided that the source and any AJCT copyright notice are indicated, the copies are not used in any way that implies AJCT endorsement of a product or service of any employer, and the copies themselves are not offered for sale.
3. Authors/employers may make limited distribution of all or portions of the Work prior to publication if they inform AJCT in advance of the nature and extent of such limited distribution.
4. For all uses not covered by items 2 and 3, authors/employers must request permission from AJCT.
5. Although authors are permitted to re-use all or portions of the Work in other works, this does not include granting third-party requests for reprinting, republishing, or other types of re-use.
INFORMATION FOR AUTHORS
AJCT Copyright Ownership
It is the formal policy of AJCT to own the copyrights to all copyrightable material in its technical publications and to the individual contributions contained therein, in order to protect the interests of AJCT, its authors and their employers, and, at the same time, to facilitate the appropriate re-use of this material by others.
Author/Employer Rights
If you are employed and prepared the Work on a subject within the scope of your employment, the copyright in the Work belongs to your employer as a work-for-hire. In that case, AJCT assumes that when you sign this Form, you are authorized to do so by your employer and that your employer has consented to the transfer of copyright, to the representation and warranty of publication rights, and to all other terms and conditions of this Form. If such authorization and consent has not been given to you, an authorized representative of your employer should sign this Form as the Author.
Reprint/Republication Policy
AJCT requires that the consent of the first-named author and employer be sought as a condition to granting reprint or republication rights to others or for permitting use of a Work for promotion or marketing purposes.
GENERAL TERMS
1. The undersigned represents that he/she has the power and authority to make and execute this assignment.
2. The undersigned agrees to indemnify and hold harmless AJCT from any damage or expense that may arise in the event of a breach of any of the warranties set forth above.
3. In the event the above work is accepted and published by AJCT and consequently withdrawn by the author(s), the foregoing copyright transfer shall become null and void and all materials embodying the Work submitted to AJCT will be destroyed.
4. For jointly authored Works, all joint authors should sign, or one of the authors should sign as authorized agent
for the others.
Licenced by :
Creative Commons Attribution 4.0 International License.