Mining the Crowd
Abstract
Harnessing a crowd of Web users for data collection has recently become a wide-spread
phenomenon. A key challenge is that the human knowledge forms an open world and it is thus
difficult to know what kind of information we should be looking for. Classic databases have
addressed this problem by data mining techniques that identify interesting data patterns. These
techniques, however, are not suitable for the crowd. This is mainly due to properties of the
human memory, such as the tendency to remember simple trends and summaries rather than
exact details. Following these observations, we develop here a novel model for crowd mining.
We will consider in the talk the logical, algorithmic, and methodological foundations needed
for such a mining process, as well as the applications that can benefit from the knowledge
mined from crowd.