networks & systems laboratory> research> current projects> TEMA: the evolutionary meta-learning agent

TEMA: The Evolutionary Meta-learning Agent
Smart Internet Technology Research Group

Aims

- To design and develop a mechanism to facilitate machine learning among an arbitrary set of tasks.

- The system developed should allow various other applications to be “plugged in” to provide a solution to a range of arbitrary machine learning tasks.

Introduction

- The norm in current applications of machine learning requires expert knowledge of machine learning in order to select and implement algorithms that will support a task satisfactorily.

- Despite the availability of tools such as WEKA, C5 and MATLAB’s Neural Network Toolkit, the amount of duplicate work done over different machine learning tasks is still quite significant. This is primarily due to the tweaking of the algorithm implementation to improve results. The large number of algorithms available also magnifies this problem.

- To limit the knowledge a user requires and to reduce the redundancy involved in utilisating machine learning, an evolutionary meta-learning agent is proposed

Meta-learning

- The automation of the algorithm selection process requires an algorithm, called a meta-learning algorithm, that is capable of selecting an appropriate machine learning algorithm given the problem at hand. The figure below depicts the typical mechanics of such a meta-learning algorithm.

*click on image to enlarge

- Once trained, the meta-learner given a dataset (pertaining to the problems at hand) will predict the appropriate base-learner to employ. This base learner is then given the same dataset and produces an appropriate classifier.

Issues

- Aside from the selection of the meta-learner (and the pool of base-learners), several other issues must also be addressed in the development of the framework.

- How a machine learning problem is defined. To provide a plug-in that supplies machine learning solutions to a generic field of problems, the problem definition is paramount. Aside from the data, this includes information on application restrictions and other background information. The problem definition also has ramifications for the meta-learner as it must distinguish between both similar and varying problems in order to select an appropriate classifier.

- To make the agent evolutionary, not only must it be made incremental, but it must also be able to evolve with the development of new algorithms (as well as meta-algorithms). Essentially, careful planning is required when designing the agent so that it has all the flexibility required to facilitate fundamental change.

- To refine the accuracy of the meta-learner (in this case, the training and testing error on various data sets relative to the predicted classifier), the meta-data should be updated whenever possible. This means that the testing of the new datasets must be conducted on the various algorithms and their variations. This will lead to an enormous amount of computation. This means that the agent should be streamlined to be extremely efficient. It should also conduct such testing offline.

- A knowledge base must be accumulated so that the agent can at least act, albeit primitively. An alternative is to have a default procedure for algorithm selection.

 

Fundamental TEMA Architecture

- Currently, the standard process for selecting a machine learning algorithm and improving its performance is depicted below.

- The TEMA architecture illustrated below automates this process.

*click on image to enlarge

- The general idea is to accumulate meta-data in order to test and train a better meta-learner.

- Meta-data is generated and utilised by the meta-learner to select an appropriate machine learning algorithm. The choice made, along with the meta-data, is then stored for future use.

- The base-learner selected by the meta-learner is then trained using the input data set and an output classifier produced.

- When TEMA is not being used, its resources could be used to derive an improved meta-learner. Of course, this is only when new data has been accumulated.

Application of TEMA in IEMS

- The Intelligent Email Sorter or IEMS is an email client which aids the user by sorting email as it arrives.

- The need for an agent such as TEMA to cope with the machine learning needs of IEMS is two-fold. Firstly,different people organise their email quite differently. This means that different learning mechanisms may be achieve the best performance for different users. Secondly, the number of possible learning tasks in email is large.

- Essentially, the IEMS client will use a pre-defined protocol to specify a learning problem, which TEMA will then try to solve. The relationship has yet to be defined. One alternative is for TEMA to return the appropriate classifier to IEMS, while another is for TEMA to retain the classifier and perform the classifications on behalf of IEMS.

- If classification tasks are performed by TEMA, one of the notable advantage is that TEMA can evolve the learner (associated with the task from IEMS) as it accumulates new information from other tasks. However, this would mean that IEMS would incur an added overhead, which may reduce efficiency.

Contacts

Dr Daren Ler
Dr Irena Koprinska
Associate Professor Judy Kay
Dr Eric McCreath

*This research is supported by an Australian Postgraduate Award and a Smart Internet Technology CRC scholarship.
 
University of SydneyDesigned by eliu