You can edit almost every page by Creating an account and confirming your email.

SemOpenAlex

From EverybodyWiki Bios & Wiki


SemOpenAlex
File:Semopenalex-screenshot-2023.png
Screenshot of SemOpenAlex interface
Type of site
Scholarly Knowledge Graph
WebsiteSemOpenAlex.org
CommercialNo
Current statusActive
Content license
CC0 (Creative Commons Zero)

SemOpenAlex is an open RDF knowledge graph modelling the global scholarly landscape. Introduced in 2023, it transforms OpenAlex into a standards-compliant RDF graph. With over 26 billion triples, it covers publications, authors, institutions, journals, and scientific concepts, supporting advanced analytics, semantic publishing, and recommendation systems. The associated research paper received the ISWC Best Paper Award 2023 in the resource track, highlighting its impact.[1][2]

Overview

SemOpenAlex addresses challenges in navigating the growing volume of scientific literature by providing an interconnected, machine-actionable data structure.[1] It offers:

  • A SPARQL endpoint for semantic querying.
  • RDF dumps for bulk data access.
  • A semantic search interface for real-time exploration.
  • Knowledge graph embeddings for applications like recommendation systems.
  • Integration into the Linked Open Data (LOD) cloud with links to resources such as Wikidata and the Microsoft Academic Knowledge Graph.

Development and Hosting

Developed by Michael Färber, affiliated in 2023 with Karlsruhe Institute of Technology (KIT), and metaphacts GmbH, SemOpenAlex uses established vocabularies like Dublin Core (DCterms), FaBiO, and SKOS, adhering to FAIR principles.[1]

Key Statistics

As of 2023, SemOpenAlex contains:

  • 249 million publications.
  • 135 million authors.
  • 108,000 institutions.
  • 1.7 billion citations.

Applications

SemOpenAlex supports[1]:

  • Analytics: Enables large-scale research impact assessments and trend analysis.
  • Recommender Systems: Suggests publications, collaborators, and venues with explainability.
  • Semantic Publishing: Links publications to datasets and methods, enhancing scientific communication.
  • AI Integration: Provides reliable metadata for citation generation and scholarly LLMs.
  • Benchmarking: Serves as a resource for testing systems on large-scale, realistic knowledge graphs.

Licensing

The data is licensed under Creative Commons Zero (CC0), enabling unrestricted use. Source code is available on GitHub.

See Also

References

  1. 1.0 1.1 1.2 1.3 Färber, Michael; Lamprecht, David; Krause, Johan; Aung, Linn; Haase, Peter (2023). "SemOpenAlex: The Scientific Landscape in 26 Billion RDF Triples". Proceedings of the 22nd International Semantic Web Conference (ISWC'23). Athens, Greece. arXiv:2308.03671.
  2. "ISWC 2023 Awards". 8 November 2023. Retrieved 2024-12-01.

External Links



This article "SemOpenAlex" is from Wikipedia. The list of its authors can be seen in its historical and/or the page Edithistory:SemOpenAlex. Articles copied from Draft Namespace on Wikipedia could be seen on the Draft Namespace of Wikipedia and not main one.