Clearview Forbids Users From Scraping Its Database Of Images It Scraped From Thousands Of Websites
from the don't-scrape-me-bro dept
The use of automated systems or software to extract the whole or any part of the Service or Website, the Information or data on or within the Service or the Website, including image search results or source code, for any purposes (including uses commonly known as “scraping”) is strictly prohibited.
Pretty sure a bunch of the sites scraped by Clearview have similar clauses in their terms of use. And if Clearview doesn't believe those terms should be honored, it shouldn't expect others to give it the respect it refuses to extend to others. I don't think anyone else should necessarily be in possession of everything in Clearview's facial recognition database but I do think someone needs to scrape the shit out of it on sheer principle.
Also bundled in this package of public records is Clearview's laughable "accuracy" test. It compares itself to Rekognition and its highly publicized failure. When Amazon's tech was tested, it misidentified several DC legislators as criminals, especially those that weren't white and male.
Clearview touts its own success in this document [PDF], which covers a non-independent test of its AI performed in 2019. Here are the results:
The test compared the headshots from all three legislative bodies against Clearview’s proprietary database of 2.8 billion images (112,000 times the size of the database used by the ACLU). The Panel determined that Clearview rated 100% accurate, producing instant and accurate matches for every one of the 834 federal and state legislators in the test cohort.
LOL. This is proof of nothing. Anyone with access to a reverse image search could perform this test with the same accuracy. While Amazon's AI was tested against arrestees' mugshots, Clearview's was tested against photos and info scraped from social media profiles and public websites. Of course it was able to positively identify politicians, most of whom maintain multiple social media accounts and websites. It would only be notable if the AI had failed to perform this simple task given the wealth of information it had to work with.
In conclusion, Clearview sucks. Its tech is unproven and its policy on scraping is the apex of hypocrisy.
On the other hand, the company seems to be harvesting criticism as fast as its harvesting web content, so the prognosis on its continued survival remains refreshingly bleak.
Filed Under: facial recognition, hypocrisy, scraping
Companies: clearview
No comments:
Post a Comment