食品汙染之網路媒體即時分類系統 - 以三聚氰胺為例

2
食食食食食食食食 食食食食食食 - 食食食食食食食 食食食食 食食食食食食食食食食食食食食食 食食食食食食食食食食食食食食食食 食食食食食 食食食食 食食食食食食食食食食食食食食食食食食食 ,,一。 食食食食食 食食食食食食食食食食食食食食食食食食食 食食食食食 食食食食食食食食食食食 食食食食食食食食食食食食食食 食食食食 ,,一,, 食食食食食食食食 2008 食食食食食食食食食食食食食食)。 食食 食食食食食食食食食食食食食食食食食食食食食食食食 食食食食食食食食食食食食 ,, Google News 食食食食食食食食食食食食 食食食食食食食食食食食食食食食食食食食食食 ( 食食食食食食食食食食食食食食 ) 食食食食 食食食食食食食食食食食 食食食食 ,一, 食食食食食食食食食食食 食食食食食食食食 C/C++ 食食食食食食食食食食食食食 (Latent Dirichlet Allocation, LDA) 食 食食 Gibbs LDA++ 食食食食食 食食食食食食食食食食食食食食食食 ,( Unsupervised Learning 食 食食食 食食 Supervised Learning 食食食食 食食食食食食食食食食食食食食食食食食食食食食 食食食食食食食食食食食 ),。 食食食食食食食食食食食食食食食食食食食食食食食食食食食食食食食食食食 「、、、、」 521 食食食食食食 793 食食食食食食食食食食食食食食食食食食食食食食食食食食食食食 (precision) 食食食食 (recall) 食食食食食食食食食食食食食食 F-measur e 食食食食食食食 食食食食食食食食食食食食食食食食食食食食食食 69.66% 食食食食食食 64.52% 食食食 F-meas ure 食 0.68 食食食食食食食食食食食食食食食食食食食食食食 食食食食食食食食食食食食食食食食食食食食食食食食 食食食食食食食食食 食食食食食食 食食食食食食食食食食食食食食

Upload: jolie-farley

Post on 01-Jan-2016

19 views

Category:

Documents


2 download

DESCRIPTION

食品汙染之網路媒體即時分類系統 - 以三聚氰胺為例. 中文摘要 食品安全對於人民健康的影響甚大,因此保障人民食品安全提升其品質,是重要的一項議題。隨著國際間食品及食品原料進出口貿易的往來日漸頻繁,世界各地食品及原料的流動變得無遠弗屆,所以只要一有食品及原料遭受汙然,隨時都有可能流通到世界各地,導致社會大眾的不安與恐慌(如 2008 年爆發的三聚氰胺食品污染事件)。 - PowerPoint PPT Presentation

TRANSCRIPT

  • -2008Google News()C/C++(Latent Dirichlet Allocation, LDA)Gibbs LDA++Unsupervised LearningSupervised Learning521793(precision)(recall)F-measure69.66%64.52%F-measure0.68

  • A Web-based Online Document Classification System for Food contamination News Take Melamine as the ExampleFood safety is essential to human health, and also to guard the food safety as well as the quality of food is an important issue. As long as the international food and food materials trade being more frequently, the flow of them are far-reaching, and will spread all over the world once they get contaminated. Turns out, it will bring unrest and panic to the public (such as the 2008 outbreak of melamine food contamination).This study is to build an online food contamination monitoring and automatic classification system to regularly search Google news about food contamination (we took melamine-related news as an example) in Taiwan and obtain the most complete information and stored them in the database. The system will classify news into correct categories and can help users find relevant information. In this study, we used Gibbs LDA++, which is a C/C++ implementation of Latent Dirichlet Allocation (LDA) to train news documents by unsupervised learning and supervised learning. The classifier was built by the parameter estimations and inferences from LDA training results and then adjusted manually by human expert. We defined the melamine news as five categories including "contamination", "analysis", "medical and health", "law and policy" and others. 521 news documents were used as training data to train the classifier and 793 documents were used to test and evaluate the classifier. The assessment of the effectiveness for the classifier is based on precision, recall and F-measure. According to the evaluation for the classifier, the macro-precision is 69.66%, the macro-recall is 64.52% and the F-measure is 0.68.According to the evaluation results, we estimate the performance of classification system and will improve the system in the future research. We expect the system could save time for reading complexity news, and help people get prepared for food contamination.