Global Journal of Computer Science and Technology, D: Neural & Artificial Intelligence, Volume 23 Issue 2
relationships or between a relationship and an entity. This is possible by considering a relationship itself as an entity. The triple of an entity, a relation and an entity is called a fact. They represent respectively a subject, a predicate, and an object. Example: yago: Zafy_Albert rdf: type schema: Person . The two entities are called the arguments of the fact. In YAGO, each fact is given a fact identifier, that is one of its strengths. To maintain the semantic integrity of the data, YAGO uses the SHACL 3 Standard which makes it possible toexpress semantic integrity constraints. b) Mathematical Definition of YAGO A YAGO ontology over a finite set of common entities C, a finite set of relation names R and a finite set of fact identifiers I is a function with the following definition: y : → ( ) × × ( ) For facts that require more than two arguments, it is assumed that for each n-ary relation, a primary pair of its arguments can be identified. The primary pair can be represented as a binary fact with a fact identifier: #1: AlbertEinstein HASWONPRIZE NobelPrize. All other arguments can be represented as relations that hold between the primary pair and the other argument: #2: #1 TIME 1921. Now, it’s time to see how to exploit YAGO for Tontolo Malagasy. c) Exploiting YAGO for Tontolo Malagasy In the mass of YAGO data, there is very little portion of information on Malagasy culture. After a brief test, we noticed that out of 1,048,576 facts, only 80 mention Madagascar as a subject or as an object. In addition, there are specific Malagasy concepts and relationships that are totally unknown to YAGO. And finally, and obviously, YAGO does not understand the Malagasy language. However, thanks to the flexibility of YAGO, including all these specificities that it currently lacks is not an impossible mission. There are three things we can do to enable YAGO extension on Tontolo Malagasy : 1. Data integration 2. Implicit translation 3. Implicit specification . Data integration The challenge is to know how to recognize everything that is essential to our project and also to know at what level we must act (adapt, adopt, add, personalize, withdraw). Actually we are more interested in the taxonomy and in the common entities than in the named entities or in the facts because we would like to contribute our own named entities and our own facts. However, it is quite tricky because even if Madagascar is an island, it is not isolated from the rest of the world, so we still have to leave room for everything related to the outside world. Let us for now mainly focus on persons, places and organizations. In the YAGO taxonomy, the class person has 1569 subclasses that correspond to 1569 different professions. They don't all interest us right away. We will start with a small number to be able to accommodate the historical and cultural characters of our knowledge base. Here are some examples of professions that we could integrate: Ambassador, Contestant, Entrepreneur, Evangelist, Foreigner, Governer, Head_of_State, Historian, Humanitarian, Leader, Sovereign… Each person may be defined through YAGO-predicates like <hasChild>, <hasGivenName>, <isCitizenOf>, <hasFamilyName>, <wasBornIn>, <isMarriedTo>, <worksAt>, <isLeaderOf>, <diedIn>, <hasGender>, <holdsPoliticalPosition>, <diedOnDate>, <livesIn>, <wasBornOnDate>, <graduatedFrom> … Concerning places, YAGO integrates data from GeoNames for places. GeoNames is a geographical user- editable database that covers all countries and contains over eleven million placenames that are available for download free of charge. The Tontolo Malagasy project could extract directly from GeoNames all the geographical entities that concern Madagascar, if they lack in YAGO. In the YAGO taxonomy, the class organization has 278 subclasses that correspond to 278 different types of organization. As with persons, they don’t all interest us. Here are some examples of organization types that we will certainly integrate: academy, church_school, diplomatic_mission, deputation, educational institute, family, industry, institution, local . Finally, here is an example of how to describe the President Zafy Albert in the language of YAGO. Global Journal of Computer Science and Technology Volume XXIII Issue II Version I 5 ( )D Year 2023 © 2023 Global Journals i Guideline for Including Unperceivable Knowledge in a Universal Ontology Experimentation Field: Ontology Malagasy
RkJQdWJsaXNoZXIy NTg4NDg=