The Open APC initiative releases datasets on fees paid for open access journal articles by universities and research institutions under an open database license. Open APC, which is also supported by the DINI Working Group Electronic Publishing, is located at Bielefeld University Library. Since October 2015 Open APC ist part of the INTACT project.
Main place for Open APC to collect and maintain its data is GitHub, where the core data file is kept and redacted in CSV format. This site and its open backend were established to improve accessibility and re-use of the data, which means that there are 3 ways to access the Open APC dataset, differing in their tradeoff between flexibility and ease of handling:
The most basic way to access the Open APC data is the main APC file on GitHub. It is a CSV file, meaning that it should be easily processable by a wide range of tools and programming languages. A schema description of the file can be found here.
Since the raw CSV data can be difficult to analyze without tool support, a more elegant method is querying the Open APC OLAP Server. This service implements an online analytical processing framework based on cubes, forming the backend of treemaps.intact-project.org.
Some example queries:
(How much did every individual publisher receive in 2014?)
(What journals did Bielefeld University pay for?)
(List detailed facts about every article Regensburg University paid for in 2013)
Note: The query return format is JSON. If you want to view the results directly in your browser, it is highly advisable to use an addon or extension that properly formats JSON, like JSONView for Firefox and Chrome.
The treemaps on this site present the most easy and intutitive way to browse and inspect the Open APC data. It is possible to either view the data for single institutions or the whole data set.