Journal of Geographical Sciences >
Methodology, progress and challenges of geoscience knowledge graph in International Big Science Program of Deep-Time Digital Earth
Zhu Yunqiang (1977-), PhD and Professor, specialized in geographic knowledge graphs and geospatial data sharing. E-mail: zhuyq@igsnrr.ac.cn |
Received date: 2024-08-04
Accepted date: 2025-02-20
Online published: 2025-09-05
Supported by
Strategic Priority Research Program of the Chinese Academy of Sciences(XDB0740000)
National Key Research and Development Program of China(2022YFB3904200)
National Key Research and Development Program of China(2022YFF0711601)
Key Project of Innovation LREIS(PI009)
National Natural Science Foundation of China(42471503)
Deep-time Earth research plays a pivotal role in deciphering the rates, patterns, and mechanisms of Earth’s evolutionary processes throughout geological history, providing essential scientific foundations for climate prediction, natural resource exploration, and sustainable planetary stewardship. To advance Deep-time Earth research in the era of big data and artificial intelligence, the International Union of Geological Sciences initiated the “Deep- time Digital Earth International Big Science Program” (DDE) in 2019. At the core of this ambitious program lies the development of geoscience knowledge graphs, serving as a transformative knowledge infrastructure that enables the integration, sharing, mining, and analysis of heterogeneous geoscience big data. The DDE knowledge graph initiative has made significant strides in three critical dimensions: (1) establishing a unified knowledge structure across geoscience disciplines that ensures consistent representation of geological entities and their interrelationships through standardized ontologies and semantic frameworks; (2) developing a robust and scalable software infrastructure capable of supporting both expert-driven and machine-assisted knowledge engineering for large-scale graph construction and management; (3) implementing a comprehensive three-tiered architecture encompassing basic, discipline-specific, and application-oriented knowledge graphs, spanning approximately 20 geoscience disciplines. Through its open knowledge framework and international collaborative network, this initiative has fostered multinational research collaborations, establishing a robust foundation for next-generation geoscience research while propelling the discipline toward FAIR (Findable, Accessible, Interoperable, Reusable) data practices in deep-time Earth systems research.
ZHU Yunqiang , WANG Qiang , WANG Shu , SUN Kai , WANG Xinbing , LV Hairong , HU Xiumian , ZHANG Jie , WANG Bin , QIU Qinjun , YANG Jie , ZHOU Chenghu . Methodology, progress and challenges of geoscience knowledge graph in International Big Science Program of Deep-Time Digital Earth[J]. Journal of Geographical Sciences, 2025 , 35(5) : 1132 -1156 . DOI: 10.1007/s11442-025-2361-0
Figure 1 Cognition of geoscience knowledge and its representation in graph structures. The nodes labeled as “Concept Node” and “Attribute Node” in the Ontology Level refer to conceptual geoscience knowledge and the respective attribute information associated with them. The nodes labeled as “Instance Node” and “Instance Property” in the Instance Level represent factual geoscience knowledge and their corresponding specific attribute values. |
Figure 2 The geoscience knowledge graph (GKG) construction and sharing application software framework |
Figure 3 Online service of geoscience knowledge graph collaborative editor |
Figure 4 Whole architecture of the geoscience knowledge tree auto-renew and completion system |
Figure 5 Design philosophy of the crowdsourced knowledge editing platform |
Figure 6 Overall architecture of one-stop for geoscience knowledge graph (GeoOpenKG). The Deep-time Digital Earth International Big Science Program geoscience knowledge graph (DDE GKG) comprises three hierarchical levels: (a) BO4KG, (b) GPKGs, and (c) application-specific knowledge graphs, which correspond to the root (as the foundational base), trunk (as the supporting pillar), and crown (as the fruits of applications) of the “DDE GKG Tree,” respectively. Additionally, Geoscience Knowledge Graph Open Directory (GKGD) constructs an efficient index to other open knowledge graphs, and, together with the DDE GKG, forms the GeoOpenKG, a comprehensive platform for the opening and sharing of GKGs. |
Figure 7 Home page of the one-stop for geoscience knowledge graph |
Figure 8 Multimodal Geoscience Semantic Search System of geoscientific literature |
Figure 9 Homepage and portrait report of Academic Portrait and Report Auto-generation System |
Figure 10 Exemplary ontologies within Basic Ontology for Geoscience Knowledge Graph |
Table 1 The number of concepts and relations in the Geoscience Academic Knowledge Graph |
Concept | Count | Relation | Count |
---|---|---|---|
paper | 3,711,981 | is_cited_by | 39,540,518 |
author | 3,924,701 | on_the_topic_of | 9,194,720 |
affiliation | 27,195 | is_written_by | 17,219,176 |
topic | 792,281 | is_published_in | 3,711,981 |
journal | 535 | is_last_known_in | 3,290,176 |
timescale | 1792 | has_illustration | 6,417,139 |
mention | 829,749 | has_table | 1,061,790 |
illustration | 6,417,139 | has_mention | 17,920,198 |
papertable | 1,061,790 | mention_location | 723,180 |
location | 412,108 | has_geohash | 2,161,150 |
geohash | 2,161,150 | mention_timescale | 2,420,151 |
parent_mention | 51,934 | ||
Total | 19,340,421 | Total | 86,492,937 |
Figure 11 Geoscience professional knowledge graphs constructed by Geoscience Knowledge Graph Collaborative Editor |
Figure 12 Illustrative instances of application geoscience knowledge graphs |
Table 2 Comparison of key geoscience knowledge projects |
Geoscience knowledge project | Focus and scope | Initiating organization/individual | Knowledge scale | Construction methodology |
---|---|---|---|---|
DDE GKG | Earth’s geological and evolutionary processes, integrating multiple Earth science disciplines | DDE International Big Science Program | 20 geoscience disciplines, 62,000 solid Earth concepts/ properties; >150 million triples (literature graph) | Hybrid: top-down expert-driven ontology, bottom-up AI-driven instance extraction |
GeoSciML | Data model for geoscience data sharing and exchange | IUGS | 1772 concepts | Expert-driven manual ontology creation |
SWEET | Ontology library describing core concepts in Earth sciences | NASA | 4533 concepts, 359 properties | Expert, crowdsourcing, and domain dictionary- based curation |
GeoNames Ontology | Global geographic name dictionary | Marc Wick (person) | ~12 million geographic entities, ~25 million place names | Expert and crowdsourcing- based curation |
LinkedEarth Ontology | Semantic platform for paleoclimate data integration and archiving | National Science Foundation, United States | 6 sub-ontologies, 148 concepts, 55 relationships | Expert-driven manual ontology creation |
GeoDeepDive | Geoscience literature processing | National Science Foundation, United States | ~13.4 million geoscience documents | Machine learning- based literature mining |
Figure 13 Overall technical scheme of Deep-time Digital Earth International Big Science Program knowledge graph construction |
Figure 14 The steps for constructing ontologies of the geoscience knowledge graph |
[1] |
|
[2] |
|
[3] |
|
[4] |
DDE, 2021a. Geoscience Knowledge Graph Collaborative Editor. Retrieved from https://editor.deep-time.org/#/.
|
[5] |
DDE, 2021b. Geoscience Knowledge Tree Auto-Renew and Completion System. Retrieved from https://labeling.acemap.cn/#/.
|
[6] |
DDE, 2021c. Knowledge Hub. Retrieved from https://deep-time.org/#/home-knowledge/hub.
|
[7] |
DDE, 2021d. Multimodal Geoscience Academic Knowledge Graph. Retrieved from https://gakg.deep-time.org/#/svirql.
|
[8] |
DDE, 2022. One-stop for GeoScience Knowledge Graph. Retrieved from https://geoopenkg.deep-time.org/.
|
[9] |
DDE, 2023a. DDE WIKI. Retrieved from https://wiki.deep-time.org/wiki/Main_Page.
|
[10] |
DDE, 2023b. Deep Literature. Retrieved from https://ddescholar.acemap.info/.
|
[11] |
|
[12] |
|
[13] |
|
[14] |
IUGS, 2019. Deep-Time Digital Earth (DDE). Retrieved from https://www.iugs.org/dde.
|
[15] |
|
[16] |
|
[17] |
|
[18] |
|
[19] |
|
[20] |
|
[21] |
|
[22] |
|
[23] |
|
[24] |
|
[25] |
|
[26] |
|
[27] |
|
[28] |
|
[29] |
|
[30] |
|
[31] |
|
[32] |
|
[33] |
|
[34] |
|
[35] |
|
[36] |
|
[37] |
|
[38] |
|
[39] |
|
[40] |
|
[41] |
|
[42] |
|
[43] |
|
[44] |
|
[45] |
|
[46] |
|
[47] |
|
[48] |
|
[49] |
|
[50] |
|
[51] |
|
[52] |
|
[53] |
|
[54] |
|
[55] |
|
/
〈 |
|
〉 |