A text mining approach for automatic taxonomy generation and text categorisation

Pais, N., Dotsika, F. and Shearer, J. 2007. A text mining approach for automatic taxonomy generation and text categorisation. Journal of Knowledge Management Practice. 8 (S1).

TitleA text mining approach for automatic taxonomy generation and text categorisation
AuthorsPais, N., Dotsika, F. and Shearer, J.
Abstract

The research presented in this paper investigates the use of a text mining approach for automatic taxonomy generation and text categorisation for the content management system of Alergoclínica, a private clinic of Dermatology and Allergies in São Paulo, Brazil. Text mining has been of interest for many years, but despite the ever-increasing range of text mining applications available, there are neither common standards nor shared evaluation criteria to enable comparison among the different approaches. Numerous problems are addressed by various groups, often using private data sets, so that it is virtually impossible to determine the quality, performance and scalability of the existing systems. Three text mining tools, selected against specific criteria, were investigated to determine their suitability in this real world environment. Surprisingly, the study shows that none are really effective for the task, though each gave some useful output. None could be recommended for full scale implementation.

JournalJournal of Knowledge Management Practice
Journal citation8 (S1)
ISSN1705-9232
YearMay 2007
PublisherLeadership Alliance Inc.
Web address (URL)http://www.tlainc.com/articlsi6.htm
Publication dates
PublishedMay 2007

Related outputs

Digital Transformation and Social Business: A Practice-Based Pathway Framework for SMEs
Dotsika, F. and Patrick, K. 2021. Digital Transformation and Social Business: A Practice-Based Pathway Framework for SMEs. International Journal of Research in Business and Management. 3 (2), pp. 1-16.

Blockchain applications for SME transformation: a pilot framework
Dotsika, F. 2019. Blockchain applications for SME transformation: a pilot framework . International Journal of Management and Applied Science. 5 (2), pp. 68-81.

Identifying trends and flows in Communication and Information Processing by means of keyword network analysis
Dotsika, F. and Watkins, A. 2017. Identifying trends and flows in Communication and Information Processing by means of keyword network analysis. 3rd International Conference on Communication and Information Processing. Tokyo, Japan 24 - 26 Nov 2017 ACM. https://doi.org/10.1145/3162957.3162990

Identifying potentially disruptive trends by means of keyword network analysis
Dotsika, F. and Watkins, A. 2017. Identifying potentially disruptive trends by means of keyword network analysis. Technological Forecasting & Social Change. 119, pp. 114-127. https://doi.org/10.1016/j.techfore.2017.03.020

Implementing a social intranet in a professional services environment through Web 2.0 technologies
Janes, S.H., Patrick, K. and Dotsika, F. 2014. Implementing a social intranet in a professional services environment through Web 2.0 technologies. The Learning Organization Journal. 21 (1), pp. 26-47. https://doi.org/10.1108/TLO-11-2012-0068

Collaborative KM for SMEs: a framework evaluation study
Dotsika, F. and Patrick, K. 2013. Collaborative KM for SMEs: a framework evaluation study. Information Technology & People. 20 (1), pp. 368-382. https://doi.org/10.1108/ITP-11-2012-0142

The next generation of the web: an organisational perspective
Dotsika, F. 2012. The next generation of the web: an organisational perspective. University of Westminster.

Semantic technologies:from niche to the mainstream of Web 3? A comprehensive framework for web Information modelling and semantic annotation
Dotsika, F. 2012. Semantic technologies:from niche to the mainstream of Web 3? A comprehensive framework for web Information modelling and semantic annotation. PhD thesis University of Westminster School of Electronics and Computer Science https://doi.org/10.34737/8z712

Semantic APIs: scaling up towards the Semantic Web
Dotsika, F. 2010. Semantic APIs: scaling up towards the Semantic Web. International Journal of Information Management. 30 (4), pp. 335-342. https://doi.org/10.1016/j.ijinfomgt.2009.12.003

Uniting formal and informal descriptive power: reconciling ontologies with folksonomies
Dotsika, F. 2009. Uniting formal and informal descriptive power: reconciling ontologies with folksonomies. International Journal of Information Management. 29 (5), pp. 407-415. https://doi.org/10.1016/j.ijinfomgt.2009.02.002

Web knowledge discovery trends: from semantic annotation to semantic APIs
Dotsika, F. 2009. Web knowledge discovery trends: from semantic annotation to semantic APIs. in: Dalkir, K. (ed.) Proceedings of the 6th International Conference on Intellectual Capital, Knowledge Management & Organizational Learning, School of Information Studies, McGill University, Montreal, Quebec, Canada, 1-2 October 2009 Reading Academic Publishing. pp. 314-320

Reconciling Web information classification approaches: the methods, the facts and the hype
Dotsika, F. 2008. Reconciling Web information classification approaches: the methods, the facts and the hype. in: O’Sullivan, K. (ed.) Proceedings of the 5th International Conference on Intellectual Capital, Knowledge Management & Organizational Learning, New York Institute of Technology, New York, USA, 9-10 October 2008 Reading Academic Publishing. pp. 137-144

Knowledge sharing: developing from within
Patrick, K. and Dotsika, F. 2007. Knowledge sharing: developing from within. The Learning Organization. 14 (5), pp. 395-406. https://doi.org/10.1108/09696470710762628

Interactive business development, capturing business knowledge and practice: a case study
McKelvie, G., Dotsika, F. and Patrick, K. 2007. Interactive business development, capturing business knowledge and practice: a case study. The Learning Organization. 14 (5), pp. 407-422. https://doi.org/10.1108/09696470710762637

Quality issues in Web information and knowledge management
Dotsika, F. 2007. Quality issues in Web information and knowledge management. in: Remenyi, D. (ed.) Proceedings of the 4th International Conference on Intellectual Capital, Knowledge Management & Organizational Learning, University Of Stellenbosch Business School, South Africa, 15-16 October 2007 Reading Academic Publishing. pp. 119-126

Towards the new generation of web knowledge
Dotsika, F. and Patrick, K. 2006. Towards the new generation of web knowledge. VINE: the journal of information and knowledge management systems. 36 (4), pp. 406-422. https://doi.org/10.1108/03055720610716665

Building knowledge management systems: a proposal for a reconsideration of the development process
Dotsika, F. and Patrick, K. 2006. Building knowledge management systems: a proposal for a reconsideration of the development process. in: KMAC 2006 Proceedings of the Knowledge Management Aston Conference,17 -18 July 2006 Birmingham OR Society.

Knowledge capture, sharing and maintenance in the semantic Web age: a framework proposal
Dotsika, F. and Patrick, K. 2005. Knowledge capture, sharing and maintenance in the semantic Web age: a framework proposal. in: Wenn, A. and Dhanda, K.K. (ed.) Proceedings of the 4th Annual ISOneWorld Conference and Convention: Enabling Executive Information Technology Competencies. March 30 - April 1, 2005, Las Vegas, NV, USA Washington, USA Information Institute.

From end-users to bots: the balancing act of web-based knowledge search and sharing
Dotsika, F. and Patrick, K. 2005. From end-users to bots: the balancing act of web-based knowledge search and sharing. in: Remenyi, D. (ed.) Proceedings of the ICICKM 2005: International Conference on Intellectual Capital, Knowledge Management and Organisational Learning Reading, UK Academic Conferences and Publishing International. pp. 143-150

A risky business retirement (for higher education)
Coakes, E., Bradburn, A., Shearer, J., Dotsika, F. and Barnett, N. 2005. A risky business retirement (for higher education). in: Khosrow-Pour, M. (ed.) Managing modern organizations with information technology: 2005 Information Resources Management Association International Conference, San Diego, California, USA, May 15-18, 2005 Hershey, USA Idea Group Publishing.

Knowledge creation and sharing mechanisms: from Heads (of Departments) to Hands (of staff)
Coakes, E., Bradburn, A., Shearer, J., Dotsika, F. and Burke, T. 2004. Knowledge creation and sharing mechanisms: from Heads (of Departments) to Hands (of staff). International Journal of Knowledge, Culture and Change Management. 4.

An interoperable, graphical environment for the capturing of medical information
Dotsika, F. and Watkins, A. 2003. An interoperable, graphical environment for the capturing of medical information. Technology and Health Care. 11 (5), pp. 305-306.

From data to knowledge in e-health applications: an integrated system for medical information modelling and retrieval
Dotsika, F. 2003. From data to knowledge in e-health applications: an integrated system for medical information modelling and retrieval. Medical Informatics and the Internet in Medicine. 28 (4), pp. 231-251. https://doi.org/10.1080/14639230310001617832

GISMoE: a graph-based information system modelling environment
Dotsika, F. and Watkins, A. 2003. GISMoE: a graph-based information system modelling environment. in: Hamza, M.H. (ed.) Intelligent systems and control Canada Acta Press.

Knowledge creation and sharing mechanisms: from head to hands
Coakes, E., Bradburn, A., Shearer, J. and Burke, T. 2003. Knowledge creation and sharing mechanisms: from head to hands. 4th European Conference on Knowledge Management. Oxford, UK 18-19 Sep 2003

Modelling medical operational knowledge for e-health applications
Dotsika, F. 2002. Modelling medical operational knowledge for e-health applications. Technology and Health Care. 10 (6), pp. 474-476.

Integrating web-based information systems: WWW and the functional model
Dotsika, F. and Watkins, A. 2002. Integrating web-based information systems: WWW and the functional model. in: Humza, M.H. (ed.) Proceedings of the Sixth IASTED International Conference on Internet and Multimedia Systems and Applications: IMSA '02 Canada Acta Press.

The impact of three key healthcare technology standards on evidence based healthcare practice
Narayana, J. and Dotsika, F. 2001. The impact of three key healthcare technology standards on evidence based healthcare practice. Technology and Healthcare. 9 (6), pp. 501-503.

XML and functional databases
Dotsika, F. and Watkins, A. 2001. XML and functional databases. in: Hamza, M.H. (ed.) Intelligent systems and control Calgary, Canada Acta Press.

Permalink - https://westminsterresearch.westminster.ac.uk/item/91qy6/a-text-mining-approach-for-automatic-taxonomy-generation-and-text-categorisation


Share this

Usage statistics

110 total views
0 total downloads
These values cover views and downloads from WestminsterResearch and are for the period from September 2nd 2018, when this repository was created.