-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Pull license information from CrossRef #12
Comments
Any progress on this data service? |
Ping. Moving to Phase 1B milestone. |
API call is
So, for example:
See license section of resulting JSON See the API documentation for more details. Report problems at the issue tracker |
Thanks, Geoffrey. |
Seems like Hindawi is fully compliant, others a long way off: Another example with license info http://api.crossref.org/works/10.1155/2014/945364
|
Hindawi is always pretty agile in adopting new features as they control all their own tech. Other publishers will start submitting this info soon. This hockey-stick pattern is typical for CrossRef initiatives as many publishers need to modify their third-party production systems before they can supply the data in bulk. I know most of the bigger publishers (Elsevier, Springer, PLOS, T&F) are working on this now. I am guessing that ~ half CrossRef metadata will have this info in the next 6-9 months. |
@gbilder Good to hear, thanks for the update! I think we're all coming to this realistically too--nothing will happen over night. Indeed there are many parties that have to change systems and process on their own respective schedules. Curious to hear if you have any info on sources of scraped license data. |
Nothing specific- just that scraping is always fragile and error-prone. But I do it myself in the absence of an API or metadata, so I can hardly complain. I suppose I would generally advise that one try several passes- first through any existing APIs (.e.g CrossRef/DataCite), second through screen scraping, third through supervised screen scraping (allow human to confirm)- lastly through manual updating. I expect that relatively quickly, the last three techniques will become fallback exceptions. At least for formal scholarly articles. |
Ah, indeed, I was thinking that at the very least scraped data could be helpful for naive verification of publisher-submitted license data. What's the hit rate? What is the relative coverage? It's possible others have aggregated some of this data already, would be interesting to see. |
You can see coverage fairly easily: compare: http://api.crossref.org/members/98 |
Ah, cool, that is handy. However, I meant the relative coverage between the license data that is available (and discover-able) via public access (scrape) versus the coverage of submitted data from the publisher. Rather than the coverage relative between the submitted license data and the works registered with DOIs by those publishers. Perhaps an exercise in futility, but it could give you a better idea of the range of articles for which there is currently no easily obtainable license information, and for which publisher-submissions would reveal new information, and the rate it changes over time. |
@gbilder The link http://participation.labs.crossref.org/features/tdm provided above to track progress on providing license information does not seem to be persistent. Any pointers on how to get an update? |
Hmm. W had a server die and are still migrating over links. You can get same data via API like this: http://api.crossref.org/members/78/works?filter=has-license:true,has-full-text:true&rows=0 We will fix the link ASAP. |
@gbilder any news on this? The API call is nice but not very handy to get an overview of the progress across publishers. |
Just had a chat with @gbilder who pointed me to For instance, To check performance, see http://search.crossref.org/help/status . pinging @wrought @notconfusing |
They plan to have the info available in (Northern) spring 2014.
The text was updated successfully, but these errors were encountered: