Journal article
The dangers of webcrawled datasets
First Monday, Vol.15(2)
2010
Abstract
This article highlights legal, ethical and scientific problems arising from the use of large experimental datasets gathered from the Internet — in particular, image datasets. Such datasets are currently used within research into topics such as information forensics and image processing. This paper strongly recommends against Webcrawling as a means for generating experimental datasets, and proposes safer alternatives.
Details
- Title
- The dangers of webcrawled datasets
- Authors/Creators
- G.B. Bell (Author/Creator)
- Publication Details
- First Monday, Vol.15(2)
- Publisher
- University of Illinois
- Identifiers
- 991005540266607891
- Copyright
- Creative Commons Attribution 2.5 UK: Scotland License
- Murdoch Affiliation
- School of Information Technology
- Language
- English
- Resource Type
- Journal article
- Note
- “The dangers of Webcrawled datasets” by Graeme Bell is licensed under a Creative Commons Attribution 2.5 UK: Scotland License. Permissions beyond the scope of this license may be available at http://graemebell.net.
Metrics
359 File views/ downloads
75 Record Views