SemOpenAlex
| File:Semopenalex-screenshot-2023.png Screenshot of SemOpenAlex interface | |
Type of site | Scholarly Knowledge Graph |
|---|---|
| Website | SemOpenAlex.org |
| Commercial | No |
| Current status | Active |
Content license | CC0 (Creative Commons Zero) |
SemOpenAlex is an open RDF knowledge graph modelling the global scholarly landscape. Introduced in 2023, it transforms OpenAlex into a standards-compliant RDF graph. With over 26 billion triples, it covers publications, authors, institutions, journals, and scientific concepts, supporting advanced analytics, semantic publishing, and recommendation systems. The associated research paper received the ISWC Best Paper Award 2023 in the resource track, highlighting its impact.[1][2]
Overview
SemOpenAlex addresses challenges in navigating the growing volume of scientific literature by providing an interconnected, machine-actionable data structure.[1] It offers:
- A SPARQL endpoint for semantic querying.
- RDF dumps for bulk data access.
- A semantic search interface for real-time exploration.
- Knowledge graph embeddings for applications like recommendation systems.
- Integration into the Linked Open Data (LOD) cloud with links to resources such as Wikidata and the Microsoft Academic Knowledge Graph.
Development and Hosting
Developed by Michael Färber, affiliated in 2023 with Karlsruhe Institute of Technology (KIT), and metaphacts GmbH, SemOpenAlex uses established vocabularies like Dublin Core (DCterms), FaBiO, and SKOS, adhering to FAIR principles.[1]
Key Statistics
As of 2023, SemOpenAlex contains:
- 249 million publications.
- 135 million authors.
- 108,000 institutions.
- 1.7 billion citations.
Applications
SemOpenAlex supports[1]:
- Analytics: Enables large-scale research impact assessments and trend analysis.
- Recommender Systems: Suggests publications, collaborators, and venues with explainability.
- Semantic Publishing: Links publications to datasets and methods, enhancing scientific communication.
- AI Integration: Provides reliable metadata for citation generation and scholarly LLMs.
- Benchmarking: Serves as a resource for testing systems on large-scale, realistic knowledge graphs.
Licensing
The data is licensed under Creative Commons Zero (CC0), enabling unrestricted use. Source code is available on GitHub.
See Also
References
- ↑ 1.0 1.1 1.2 1.3 Färber, Michael; Lamprecht, David; Krause, Johan; Aung, Linn; Haase, Peter (2023). "SemOpenAlex: The Scientific Landscape in 26 Billion RDF Triples". Proceedings of the 22nd International Semantic Web Conference (ISWC'23). Athens, Greece. arXiv:2308.03671.
- ↑ "ISWC 2023 Awards". 8 November 2023. Retrieved 2024-12-01.
External Links
This article "SemOpenAlex" is from Wikipedia. The list of its authors can be seen in its historical and/or the page Edithistory:SemOpenAlex. Articles copied from Draft Namespace on Wikipedia could be seen on the Draft Namespace of Wikipedia and not main one.
