on March 28, 2013 by Lindsay Marshall in tech, Comments (0)

Features and moans.

Monitoring the URLs that people submit to Greycite helps me find new sources of metadata, and, of course, new quirks and ways that people ignore standards. So now Greycite supports title tags that are embedded in the body rather than the head (I’m looking at you phdcomics (http://www.phdcomics.com/)), Parsely metadata (http://www.parsely.com/api/crawler.html) which is supported by some publications such as The Atlantic (http://www.theatlantic.com/), and and the use of a “title” meta tag as used by the American University Law Review (http://www.aulawreview.org/) to encode paper titles (they don’t have any other useful metadata though). Interestingly Squidoo (http://www.squidoo.com/) appears to have supported Parsely in the past but no longer does so for some reason.

Bibliography

No Comments

Leave a comment

Login