The first version of our metadata input schema (a DTD, to be specific) was created in 1999 to capture basic bibliographic information and facilitate matching DOIs to citations. Over the past 20 years the bibliographic metadata we collect has deepened, and we’ve expanded our schema to include funding information, license, updates, relations, and other metadata. Our schema isn’t as venerable as a MARC record or as comprehensive as JATS, but it’s served us well.
Crossref strives for balance. Different people have always wanted different things from us and, since our founding, we have brought together diverse organizations to have discussions—sometimes contentious—to agree on how to help make scholarly communications better. Being inclusive can mean slow progress, but we’ve been able to advance by being flexible, fair, and forward-thinking.
We have been helped by the fact that Crossref’s founding organizations defined a clear purpose in our original certificate of incorporation, which reads:
Bibliographic references in scientific papers are the end result of a process typically composed of: finding the right document to cite, obtaining its metadata, and formatting the metadata using a specific citation style. This end result, however, does not preserve the information about the citation style used to generate it. Can the citation style be somehow guessed from the reference string only?
TL;DR I built an automatic citation style classifier.
TL;DR On Wednesday, October 2nd, 2019 we discovered that we had accidentally pushed the main Crossref system as part of a docker image into a developer’s account on Docker Hub. The binaries and configuration files that made up the docker image included embedded passwords and API tokens that could have been used to compromise our systems and infrastructure. When we discovered this, we immediately secured the repo, changed all the passwords and secrets, and redeployed the system code.
We’ve all been subject to floods of conference invitations, it can be difficult to sort the relevant from the not-relevant or (even worse) sketchy conferences competing for our attention. In 2017, DataCite and Crossref started a working group to investigate creating identifiers for conferences and projects. Identifiers describe and disambiguate, and applying identifiers to conference events will help build clear durable connections between scholarly events and scholarly literature.
Chaired by Aliaksandr Birukou, the Executive Editor for Computer Science at Springer Nature, the group has met regularly over the past two years, collaborating to create use cases and define metadata to identify and describe conference series and events.
2019 has been busy for the Community Outreach Team; our small sub-team travels far and wide, talking to members around the world to learn how we can better support the work they do. We run one-day LIVE local events alongside multi-language webinars, with the addition of a new Community Forum, to better support and communicate with our global membership.
This year we held a publisher workshop in London in collaboration with the British Library in February to talk about all things metadata and Open Access, before heading over to speak to members in Kyiv in March at the National Technical University of Ukraine.
2019 Board Election The annual board election is a very important event for Crossref and its members. The board of directors, comprising 16 member organizations, governs Crossref, sets its strategic direction and makes sure that we fulfill our mission. Our members elect the board - its “one member one vote” - and we like to see as many members as possible voting. We are very pleased to announce the 2019 election slate - we have a great set of candidates and an update to the ByLaws addressing the composition of the slate to ensure that the board continues to be representative of our membership.
This month we have officially released a new version of our input metadata schema. As well as walking through the latest additions, I’ll also describe here how we’re starting to develop a new streamlined and open approach to schema development, using GitLab and some of the ideas under discussion going forward.
I’m happy to announce that Bryan Vickery has joined Crossref today as our new Director of Product. Bryan has extensive experience developing products and services at publishers such as Taylor & Francis, where he led the creation of the open-access platform Cogent OA. Most recently he was Managing Director of Research Services at T&F, including Wizdom.ai after it was acquired.
The official countdown to PIDapalooza 2020 begins here! It’s 163 days to go till our flame-lighting opening ceremony at the fabulous Belem Cultural Center in Lisbon, Portugal. Your friendly neighborhood PIDapalooza Planning Committee—Helena Cousijn (DataCite), Maria Gould (CDL), Stephanie Harley (ORCID), Alice Meadows (ORCID), and I—are already hard at work making sure it’s the best one so far!