Login/New-Account | Search | Submit a Story! | Greplaw!??
 
GrepLaw
- About
- FAQ
- Discussions
- Messages
- Topics
- Authors

- Preferences
- Older Stuff
- Past Polls
- Submit Story
- XML/RSS

GrepLaw
This site is a production of the Berkman Center for Internet & Society. Please email if you have questions, contributions, or ideas about improving this site.

F & F
Family

Friends

 
Google SafeSearch Goes Too Far
posted by filter_editor on Thursday April 10, @02:30PM
from the hardcore-research dept.
Censorship edelman writes "In 2000, Google introduced a feature called SafeSearch, intended to omit from Google's results sites with "pornography and explicit sexual content." My prior research on filtering systems suggests that all systems block considerably more content than their stated rules suggest, and with that in mind I set out to evaluate SafeSearch's accuracy.

My full report is now available:
   "Empirical Analysis of Google SafeSearch".

The research indicates that Google omits at least tens of thousands of web pages without any sexually-explicit content, whether graphical or textual. SafeSearch is easily confused by ambiguous words in web page titles--like "Hardcore Visual Basic Programming," a web page that describes intense programming for experts, without any sexually-explicit content whatsoever."

More below...


SafeSearch also makes mistakes that are harder to understand--like filtering the National Middle School Association (nmsa.org) and even the front page of Northeastern University (neu.edu), not to mention numerous sites operated by US federal, state, and local governments. Among searches on subjects such as reproductive health, SafeSearch allows some results but not others in a way that seems essentially random; it is difficult to construct a rational non-arbitrary basis for which pages are allowed and which are blocked. See highlights of pages omitted from SafeSearch seemingly inconsistent with SafeSearch's stated filtering policy.

In addition to providing a listing of specific URLs excluded from Google SafeSearch, I have provided a testing system to let users quickly determine whether a given URL is excluded from SafeSearch, and to determine, for a given keyword search, which ordinary Google results are excluded by SafeSearch.

Ben Edelman
Berkman Center for Internet & Society
Harvard Law School

Finkelstein: Edelman Case Docs Up | Swedish University Blocks P2P  >

 

 
GrepLaw Login
Nickname:

Password:

[ Create a new account ]

Related Links
  • The Berkman Center
  • highlights
  • a testing system
  • Ben Edelman
  • Berkman Center for Internet & Society
  • Harvard Law School
  • edelman
  • Google
  • prior research on filtering systems
  • "Empirical Analysis of Google SafeSearch"
  • More on Censorship
  • Also by filter_editor
  • This discussion has been archived. No new comments can be posted.
    Google SafeSearch Goes Too Far | Login/Create an Account | Top | Search Discussion
    Threshold:
    The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.

    Humanity has the stars in its future, and that future is too important to be lost under the burden of juvenile folly and ignorant superstition. - Isaac Asimov

    [ home | contribute story | older articles | past polls | faq | authors | preferences ]