Extreme Datamining
Details
Serval ID
serval:BIB_266CF8230A3B
Type
Inproceedings: an article in a conference proceedings.
Collection
Publications
Institution
Title
Extreme Datamining
Title of the conference
Between Data Science and Applied Data Analysis - Proceedings of the 26th Annual Conference of the Gesellschaft für Klassifikation e.V., University of Mannheim, July 2002
Publisher
Springer
Address
Mannheim, Germany
ISBN
978-3-540-40354-8
Publication state
Published
Issued date
2003
Peer-reviewed
Oui
Editor
Schader M., Gaul W., Vichi M.
Series
Studies in Classification, Data Analysis, and Knowledge Organization
Pages
387-394
Language
english
Abstract
In recent years there have been a number of developments in the datamining techniques used in the analysis of terrabyte-sized logfiles resulting from Internet-based applications. The information which these datamining techniques provide allow knowledge engineers to rapidly direct business decisions. Current datamining methods however, are generally efficient only in the cases when the information obtained in the logfiles is close to the average. This means that in cases where non-standard logfiles (extreme data) are being studied, these methods provide unrealistic and erroneous results. Non-standard logfiles often have a large bearing on the analysis of web applications, the information which they provide can impact on new or even well established services. In this paper a recent Extreme Value Theory methodology is applied as a unique toolkit to describe, understand and predict the non-standard fluctuations as discovered in real-life Internet-sourced log data.
Web of science
Create date
23/08/2011 8:53
Last modification date
20/08/2019 13:05