SALS-SIG Research Seminar

A Framework for Text Categorization


Speaker:

Ken Williams

Web Engineering Group, University of Sydney
Date: Monday, 4th November 2002
Time: 11:00--12:30
Place: ICS Seminar Room (E6A 357) Building E6A, Macquarie University

Abstract:

The field of Text Categorization (TC) emerged in the early '90s as an active field of academic study motivated by strong application needs. TC is a broad-based discipline, drawing inspiration from the Machine Learning, Information Retrieval, and Linguistics communities. It is seen as a key component of Knowledge Management in large businesses, as well as a helpful tool for individuals and smaller organizations.

The goal of my research is to create a reusable toolset for TC in the form of a software framework, drawing on the most applicable TC methods from published research and demonstrating their applicability in real-world applications. The framework is being implemented as an object-oriented hierarchy in Perl, which allows for extremely rapid application development or integration with existing applications. The seminar will consist of an introduction to the field of TC, a discussion of the design of the framework, and example applications.

About the speaker:

Ken Williams is studying Document Categorization in the Web Engineering Group at the University of Sydney. He received his undergraduate degree from Swarthmore College in Pennsylvania, USA, where he studied mathematics and music. He has worked as a Perl consultant and mathematics teacher. He is the co-author of "Embedding Perl in HTML with Mason", an upcoming book from O'Reilly and Associates.


Parking: Visitors requiring a parking pass are asked to contact us at least one working day before the seminar.

Enquiries: sals@ics.mq.edu.au

Last modified: 1st October 2002