Mission | Objectives | Plans | Topics | Projects | Services | Resources | Applications | Publications | Course | Links | People | Contracts | Contact

Mission

To advance the semantic analysis of natural language by the use of innovative computational methods.

Objectives/Goals

  • To promote the application of the highest standard and most up-to-date methods of Machine Learning techniques to the analysis of natural language text.
  • To act as a consulting and service agent to the language industry in methods of data mining of natural language.
  • To develop generic software to support a wide range of applications for processing text.
  • To investigate and develop new methods of analysis of natural language.
  • To foster relationships with other computational linguists around the world to allow for collaboration and sharing of resources and skills.
  • To adapt and apply the methods of Minimum Message Length (MML) encoding to a range of computational linguistic problems.

Strategic Plans

  • To collect together a repository of the best software available for the analysis of natural language
  • To develop a generic system for the specification of semantic knowledge and apply it as markup of target texts.
  • To review major topics in areas of our strengths and provide critical reviews, such as for unsupervised and supervised learning methods for POS tagging and word sense disambiguation.
  • To develop approaches to Computational Linguistics that explicitly exploit the functionality of supercomputers.
  • To develop a Workflow Methodology for the creation and automatic revision and regeneration of linguistic resources requiring both paper and multimedia publication.
  • To provide a consulting service to researchers who wish to have data analysed by our methods.

back to top

Background Topics

These are topics or themes that are important to us because they create influences that effect the way we try to work systematically within the wider field of natural language processing. In some cases we have specialised expertise in these areas but in other cases we have interests in the developments in that area without having explicit expertise ourselves.

Machine Learning:

Markup:
  • XML Standards
  • SGML Standards
Tagging:
  • Application Tagsets
  • Morphology
  • Machine Readable Dictionaries
  • Lexical Databases

back to top

Research Projects

Services

  • Process Data with Data Mining Tools
  • Develop Data Mining Tools
  • Collaborate on Joint Development and Analysis

back to top

Resources

Data Mining Tools:
  • Supervised Learning: Decision Trees and Decision Graph - DTREES & DGRAPH
  • Unsupervised Learning: (Taxonomies) SNOB
  • Modelling: Probabilistic Finite State Automata - PFSA

Storage:

  • Web Database Management Systems

Processing:

  • Supercomputer Processing

Language Processing:

  • English
  • Basque
  • Chinese

Literature:

back to top

Applications

  • Meassuring Relative Distance Between Natural Languages (Beijing and Cantonese Dialects)
  • Effectiveness of Psychotherapy
  • Part-of-Speech Tagging
  • Semantic Tagging
  • Systemic Functional Linguistic Analysis
  • Computer-aided Qualitative Data Analysis
  • Basque Multimedia Grammar and Dictionary
  • Basque Dictionary Search Engine on the Web ( Currently, the site, containing 6 searchable Basque dictionaries, is still under construction and is password-protected for limited use. If you are interested in using the site for research purposes, please contact us at the addresses at the bottom of the page.)

Publications

  • (2002) Patrick, J with Whitelaw, C. "Orthographic Tries in Language Independent Named Entity Recognition". Proceedings of CoNLL-2002, Taiwan,
  • (2002) Patrick, J with Whitelaw, C. "Orthographic Tries in Language Independent Naemd Entity Recognition". Australasian Natural Lanaguge Processing Workshop, Canberra, Australia, pp 1-8, ISBN 1 86408 800 1.
  • (2002) Patrick, J with Palko, D., Munro, R., & Zappavigna, M. "User Driven Example-Based Training for Creating Lexical Knowledgebases". Australasian Natural Lanaguge Processing Workshop, Canberra, Australia, pp 17-24, ISBN 1 86408 800 1.
  • (2001). Patrick, J with Goyal, I., "Boosted Decision Graphs for NLPLearning Tasks". In: Proceedings of CoNLL-2001, Toulouse, France. http://lcg-www.uia.ac.be/conll2001/clauses/
  • (2001). Patrick, J with S. Anthony, H. Qiao, C. Charles. "Computational semantic analysis for appraising change in language usage", (Eds.) H Bunt, I van der Sluis, E. Thijsse, 4th International Workshop on Computaional Semantics, IWCS-4. pp468-472.
  • (2001). Patrick, J with P. Carter, & F. Deane, "EXCOVE and using videos in knowledge elicitation". International Journal of Human Computer Studies, 54 (3), 301-317.
  • (2000). Patrick, J with J. Zhang & X. Artola-Zubillaga, "An Architecture and Query Language for a Federation of Heterogeneous Lexical and Dictionary Databases", Computers and the Humanities, 34: 393-407.
  • (2000). Patrick, J with Balnaves, E. Kam, T. Ler, D. and Yeates, T. "A Case Study of Multimedia Architecture Design: Workflow and Version Control for Regeneration of Multimedia Systems". Grundy, J. & Han, J. (eds.). 3rd Australian Workshop on Software and System Architectures, pp88-99.
  • (1999). Patrick, J with Azkue, J.J., Bidaurrazaga, A., "Gorputza, osasuna eta gaixoa Azkuerengan (I): gorputz atalen izendapena." Euskera, XLIV: 507-576.
  • (1999). Patrick, J, "Tagging psychotherpeutic interviews for linguisitc analysis". Proc of Workshop on Towards Standards and Tools form Discourse Tagging. Assoc for Comp Linguisitics:New Brunswick. 58-64.
  • (1998). Patrick, J with A.Raman & P. Andreae, "A Beam Search Algorithm for PFSA Inference", Pattern Analysis & Applications. 1:121-129. Springer-Verlag London Ltd.
  • (2001). Patrick, J, A Student Grammar of Euskera, xv+pp400+xxv, Lincom Europa: Munich.
  • (2001) Patrick, J with Hossain, L and Rashid, M.A. Enterprise Resource Planning: Opportunities and Challenges, Idea Group Publishing, ISBN:193070836X.
  • (2002) Patrick, J with Palko, D., Munro, R., & Zappavigna, M. "Inferring Semantic Structure from Format". Digital Resources in the Humanities (ed.) C. Cole, Sydney, 2002 (in the press).
  • (2001). Patrick, J with Hossain, L and Rashid, M.A. "The Evolution of ERP Systems: A Historical Perspective", in Enterprise Resource Planning: Opportunities and Challenges, (eds) Patrick, J., Hossain, L & Rashid, M.A. Idea Group Publishing.
  • (2001). Patrick, J with Balnaves, E., Kam, T., Ler, D., Yeates, T., "A Case Study of Workflow and Version Control for Regeneration of Multimedia Systems", Computing Arts 2001: Digital Resources for Research in the Humanities, Uni. Sydney, http://setis.library.usyd.edu.au/drrh2001/
  • (2001). Patrick, J with Balnaves, E. "Managing Multimedia Content Databases: toward a Model for Content Management", Computing Arts 2001: Digital Resources for Research in the Humanities, Uni. Sydney, http://setis.library.usyd.edu.au/drrh2001/
  • (2001). Patrick, J with Palko, D. & Khan, A. "The Inductive Inference of Structure in Text Streams", Computing Arts 2001: Digital Resources for Research in the Humanities, Uni. Sydney, http://setis.library.usyd.edu.au/drrh2001/
  • (2001). Patrick, J with Anthony, S. "Discourse Semantics for the Analysis of Change in Language", Computing Arts 2001: Digital Resources for Research in the Humanities, Uni. Sydney, http://setis.library.usyd.edu.au/drrh2001/
  • (1998). Patrick, J with P. Carter and F.P. Deane, "Managing and analysing mulitple expert commentary of videotaped psychodrama sessions using FERAL", CIP98 Computers in Psychology, 15-17th April, York, UK.
  • (1998) Patrick, J with E. Heinrich & E. Kemp, "A mulitmedia information system to support the analysis of human behaviour recorded on video", CIP98 Computers in Psychology, 15-17th April, York, UK.

back to top

Course

Links

 

back to top

People

Members:
  • Stephen Anthony, PhD Scholar
  • Hong Nguyen Pham, Ph.D. Scholar
  • Prof. Jon Patrick, Director
  • Casey Whitelaw, PhD Scholar
  • Maria Herke Couchman, PhD Scholar
  • Michele Zappavigna-Lee, PhD Scholar
  • Ari Chanen, PhD Scholar
  • Toby Hawker, PhD Scholar
  • Denis Uy Dung Nguyen, Masters Scholar
  • Andrea Stern, Senior Lecturer
  • Matthew Honnibal, PhD Scholar
  • Will Radford, Research Assistant
  • John Drake, Honours Scholar
  • Tim Yeates, Honours Scholar

 

Collaborators:

  • Dr. David Dowe, Monash University
  • A/Prof. David Powers, Flinders University
  • Dr. Anand Raman, SRI
  • Prof Jim Martin, Senior Researcher
  • Prof Christian Matthiessen, Senior Researcher
  • Dr Jane Simpson, Senior Researcher

 

Contracts

Capital Markets Co-operative Research Centre (CMCRC)
The development of a workbench for testing and calibrating document classification strategies including machine learners and feature selections.


Australian Securities and Investment Commission (ASIC)
The development of a document classification and search engine technology for identifying financial scams on the Internet.

 

back to top

Contact

Postal Address:

Sydney Language Technology Research Group
School of Information Technologies
University of Sydney
NSW 2006, AUSTRALIA

 

Visitor's Address:

Tel: +61 2 9351 4174
Fax: +61 2 9351 3838

back to top

back to School of Information Technologies page

back to main Sydney Uni page


Mission | Objectives | Plans | Topics | Projects | Services | Resources | Applications | Publications | Course | Links | People | Contracts | Contact

Email webmaster. Last updated on September 8, 2005 .

provides a count of the number of hits on the page

Valid HTML 4.01!