Melita The Semantic Web is dependant on the production of machine-readable content, but this is extremely laborious to produce. The is a growing need for automated document annotation. Melita is a semi-automatic annotation tool that has an Adaptive Information Extraction engine (Amilcare ) integrated in it. Its purpose is to support the user in the process of annotation. Melita aims to gradually change the role of the user from one of annotator to one of supervisor. The system is pro-active in the sense that it takes the initiative to do any pre-processing which will be used in the future. The novelty of Melita is the possibility of tuning the Adaptive Information Extraction system so as to provide the desired level of pro-activity and intrusiveness. What's the Problem? * Machine readable content is needed for the Semantic Web * Most actual or potential users of the Semantic Web are not experts in document annotation * Manual annotation is difficult, slow, time-consuming, tedious and costly. * There exists a growing necessity of automated support for document annotation. * Current IE technologies require skilled human effort for annotation. * Many users are knowledgeable about their domain but have limited knowledge when it comes to computing and natural language processing. Towards a Solution Melita is a semi-automatic annotation tool that has an Adaptive Information Extraction engine ([1]Amilcare ) integrated in it. Its purpose is to support the user in the process of annotation. Melita aims to gradually change the role of the user from one of annotator to one of supervisor. The system is pro-active in the sense that it takes the initiative to do any pre-processing which will be used in the future. The novelty of Melita is the possibility of tuning the Adaptive Information Extraction system so as to provide the desired level of pro-activity and intrusiveness. This tuning is done by adjusting two slide bars which alter precision and recall, without the user needing to understand these concepts or to have any knowledge about natural language processing. Melita also contains a document sorting mechanism which dynamically sorts documents after every annotation in order to find the document that best covers the unexplored areas of the domain. Documents with the least number of tags are taken to cover unexplored areas of the domain where new rules can be learned if they are annotated. This approach has led to a quicker convergence of the learning algorithm whilst overcoming the problem of data sparseness. Take a Guided Tour A General Introduction video, in Shockwave [2]Flash (0.5 Mb). A Detailed Tutorial video, in Shockwave [3]Flash (1.5 Mb). Obtaining the Technology. Please contact the developers, [4]Alexiei Dingli and [5]Fabio Ciravegna. Technical requirements: Melita is a client-server system so the client requires very low technical specifications. The server requires the same specifications as Amilcare: Windows 2000, XP, Java Runtime Environment 1.3, 512 Mb RAM, 800 MHz Processor Example Applications * [6]MnM * [7]FASiL * [8]VOX generation Further Reading [9]Fabio Ciravegna, [10]Alexiei Dingli, [11]Daniela Petrelli and [12]Yorick Wilks : "[13]User-System Cooperation in Document Annotation based on Information Extraction" in [14]13th International Conference on Knowledge Engineering and Knowledge Management (EKAW02), 1-4 October 2002 - Sigüenza (Spain) Available in the eprints [15]archive. [16]Fabio Ciravegna , [17]Alexiei Dingli , [18]Daniela Petrelli and [19]Yorick Wilks : "[20] Timely and Non-Intrusive Active Document Annotation via Adaptive Information Extraction" in [21]Semantic Authoring, Annotation & Knowledge Markup (SAAKM 2002) , [22]ECAI 2002 Workshop July 22-26, 2002 , Lyon, France [23]Alexiei Dingli : "[24] Next Generation Annotation Interfaces for Adaptive Information Extraction " in [25]6 th Annual Computer Linguists UK Colloquium (CLUK03) , January 6-7, 2003 , Edinburgh, UK [26]Fabio Ciravegna , [27]Alexiei Dingli , [28]Yorick Wilks and [29]Daniela Petrelli : "Using Adaptive Information Extraction for Effective Human-centred Document Annotation" in R. Skuppin (ed.): Text Mining (preliminary title), book published by Springer Verlag, to appear in 2003 Posters: * [30]Fabio Ciravegna , [31]Alexiei Dingli and [32]Daniela Petrelli : "[33] Active Document Enrichment using Adaptive Information Extraction from Text " in [34]1st International Semantic Web Conference (ISWC2002) , June 9-12th, 2002 Sardinia, Italia [ [35]View Poster ] Available in the eprints [36]archive. * [37]Fabio Ciravegna , [38]Alexiei Dingli and [39]Daniela Petrelli : "[40] Document Annotation via Adaptive Information Extraction " in [41]The 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval August 11-15, 2002, in Tampere, Finland [ [42]View Poster ] * [43]Alexiei Dingli : "Active Document Annotation via Adaptive Information Extraction" in University of Sheffield, Department of Computer Science, Research Retreat 6 th November 2002, Sheffield [ [44]View Poster ] References 1. http://www.aktors.org/technologies/amilcare/ 2. file://localhost/home/www.aktors/htdocs/dynamic-technology-pages/melita/melshort3.html 3. file://localhost/home/www.aktors/htdocs/dynamic-technology-pages/melita/mel4.html 4. mailto:A.Dingli@dcs.shef.ac.uk 5. mailto:F.Ciravegna@dcs.shef.ac.uk 6. http://kmi.open.ac.uk/projects/akt/MnM/ 7. http://www.fasil.co.uk/ 8. http://www.voxgeneration.com/ 9. mailto:F.Ciravegna@dcs.shef.ac.uk 10. mailto:alexiei@dcs.shef.ac.uk 11. mailto:d.petrelli@sheffield.ac.uk 12. mailto:yorick@dcs.shef.ac.uk 13. http://www.dcs.shef.ac.uk/%7Ealexiei/Documents/Papers/ekaw2002.pdf 14. http://babage.dia.fi.upm.es/ekaw02/ekaw02.htm 15. http://eprints.aktors.org/archive/00000123/ 16. mailto:F.Ciravegna@dcs.shef.ac.uk 17. mailto:alexiei@dcs.shef.ac.uk 18. mailto:d.petrelli@sheffield.ac.uk 19. mailto:yorick@dcs.shef.ac.uk 20. http://www.dcs.shef.ac.uk/%7Ealexiei/Documents/Papers/saakm2002.pdf 21. http://saakm2002.aifb.uni-karlsruhe.de/index.html 22. http://ecai2002.univ-lyon1.fr/ 23. mailto:alexiei@dcs.shef.ac.uk 24. http://www.dcs.shef.ac.uk/%7Ealexiei/Documents/Posters/cluk2003.pdf 25. http://www.iccs.informatics.ed.ac.uk/%7Estephenc/cluk/cluk6.html 26. mailto:F.Ciravegna@dcs.shef.ac.uk 27. mailto:alexiei@dcs.shef.ac.uk 28. mailto:yorick@dcs.shef.ac.uk 29. mailto:d.petrelli@sheffield.ac.uk 30. mailto:F.Ciravegna@dcs.shef.ac.uk 31. mailto:alexiei@dcs.shef.ac.uk 32. mailto:d.petrelli@sheffield.ac.uk 33. http://www.dcs.shef.ac.uk/%7Ealexiei/Documents/Posters/ISWC02.pdf 34. http://iswc.semanticweb.org/ 35. http://www.dcs.shef.ac.uk/%7Ealexiei/Documents/Posters/ISWC2002.jpg 36. http://eprints.aktors.org/archive/00000116/ 37. mailto:F.Ciravegna@dcs.shef.ac.uk 38. mailto:alexiei@dcs.shef.ac.uk 39. mailto:d.petrelli@sheffield.ac.uk 40. http://www.dcs.shef.ac.uk/%7Ealexiei/Documents/Posters/SIGIR2002.pdf 41. http://www.info.uta.fi/sigir2002/ 42. http://www.dcs.shef.ac.uk/%7Ealexiei/Documents/Posters/SIGIR2002.jpg 43. mailto:alexiei@dcs.shef.ac.uk 44. http://www.dcs.shef.ac.uk/%7Ealexiei/Documents/Posters/ResearchRetreat.jpg Alexiei Dingli 3f8f84f1dc750f9522e9e557b313c188e68ce019 Fabio Ciravegna caf51d7ec2e9212eb2f5a45f5313e39d9c21196c