Logo image
The dangers of webcrawled datasets
Journal article   Open access   Peer reviewed

The dangers of webcrawled datasets

G.B. Bell
First Monday, Vol.15(2)
2010
pdf
dangers_of_webcrawled_datasets.pdf168.54 kBDownloadView
Published (Version of Record) Open Access

Abstract

This article highlights legal, ethical and scientific problems arising from the use of large experimental datasets gathered from the Internet — in particular, image datasets. Such datasets are currently used within research into topics such as information forensics and image processing. This paper strongly recommends against Webcrawling as a means for generating experimental datasets, and proposes safer alternatives.

Details

Metrics

359 File views/ downloads
75 Record Views
Logo image