Draft:SemOpenAlex
This is a draft article. It is a work in progress open to editing by anyone. Please ensure core content policies are met before publishing it as a live Wikipedia article. Find sources: Google (books · news · scholar · free images · WP refs) · FENS · JSTOR · TWL Last edited by CycloneYoris (talk | contribs) 3 seconds ago. (Update)
Finished drafting? or |
This article contains promotional content. (November 2024) |
Type of site | Scholarly Knowledge Graph |
---|---|
URL | SemOpenAlex.org |
Commercial | No |
Current status | Active |
Content license | CC0 (Creative Commons Zero) |
SemOpenAlex is an open RDF knowledge graph modeling the global scholarly landscape. Introduced in 2023, it transforms OpenAlex into a standards-compliant RDF graph. With over 26 billion triples, it covers publications, authors, institutions, journals, and scientific concepts, supporting advanced analytics, semantic publishing, and recommendation systems. The associated research paper received the ISWC Best Paper Award 2023 in the resource track, highlighting its impact.[1][2]
Overview
[edit]SemOpenAlex addresses challenges in navigating the growing volume of scientific literature by providing an interconnected, machine-actionable data structure.[1] It offers:
- A SPARQL endpoint for semantic querying.
- RDF dumps for bulk data access.
- A semantic search interface for real-time exploration.
- Knowledge graph embeddings for applications like recommendation systems.
- Integration into the Linked Open Data (LOD) cloud with links to resources such as Wikidata and the Microsoft Academic Knowledge Graph.
Development and Hosting
[edit]Developed by Michael Färber, affiliated in 2023 with Karlsruhe Institute of Technology (KIT), and metaphacts GmbH, SemOpenAlex uses established vocabularies like Dublin Core (DCterms), FaBiO, and SKOS, adhering to FAIR principles.[1]
Key Statistics
[edit]As of 2023, SemOpenAlex contains:
- 249 million publications.
- 135 million authors.
- 108,000 institutions.
- 1.7 billion citations.
Applications
[edit]SemOpenAlex supports[1]:
- Analytics: Enables large-scale research impact assessments and trend analysis.
- Recommender Systems: Suggests publications, collaborators, and venues with explainability.
- Semantic Publishing: Links publications to datasets and methods, enhancing scientific communication.
- AI Integration: Provides reliable metadata for citation generation and scholarly LLMs.
- Benchmarking: Serves as a resource for testing systems on large-scale, realistic knowledge graphs.
Licensing
[edit]The data is licensed under Creative Commons Zero (CC0), enabling unrestricted use. Source code is available on GitHub.
See Also
[edit]References
[edit]- ^ a b c d Färber, Michael; Lamprecht, David; Krause, Johan; Aung, Linn; Haase, Peter (2023). "SemOpenAlex: The Scientific Landscape in 26 Billion RDF Triples". Proceedings of the 22nd International Semantic Web Conference (ISWC'23). Athens, Greece. arXiv:2308.03671.
- ^ "ISWC 2023 Awards". 8 November 2023. Retrieved 2024-12-01.
External Links
[edit]