Basser Seminar Series

Data Cleansing as a Transient Service

Speaker: Dr Mukesh Mohania
STSM and Senior Manager, IBM Research, India

Time: Thursday 6 May 2010, 4:00-5:00pm
Refreshments will be available from 3:30pm

Location: The University of Sydney, School of IT Building, Lecture Theatre (Room 123), Level 1

Add seminar to my diary

Abstract

There is often a transient need within enterprises for data cleansing which can be satisfied by offering data cleansing as a transient service. Every time a data cleansing need arises it should be possible to provision hardware, software and staff for accomplishing the task and then dismantling the set up.

We present such a system that uses virtualized hardware and software for data cleansing. We share actual experiences gained from building such a system. We use a cloud infrastructure to offer virtualized data cleansing instances that can be accessed as services. We build a system that is scalable, elastic and configurable. Each enterprise has unique needs. It is necessary to customize both the infrastructure and the cleansing algorithms to address these needs.

Speaker's biography

Mukesh Mohania received his Ph.D. in Computer Science & Engineering from Indian Institute of Technology, Bombay, India in 1995. Currently, he is an STSM and senior manager in IBM Research - India, and leading Information Management research group. He has worked extensively in the areas of distributed databases, data warehousing, data integration, and autonomic computing. He has published more than 100 papers and also filed more than 30 patents in these or related areas. He received the best paper award for his XML and data integration work in CIKM 2004 and CIKM 2005, respectively.

He received an award from IBM Tivoli Software in 2004 for his research contribution to Policy Management for Autonomic Computing product. He was also a recipient of the "Excellence in People Management" award in IBM India in 2007. He received the "Outstanding Innovation Award" from IBM Corporation in 2008 for his Context-Oriented Information Integration work, and Technical Accomplishment Award in 2009 for his Policy work. He is an IEEE and ACM Distinguished Speaker.