On scalable RDFS reasoning using a hybrid approach

On scalable RDFS reasoning using a hybrid approach

Reasoning is a vital ability for semantic web applications since they aim to understand and interpret the data on the World Wide Web. However, reasoning of large data sets is one of the challenges facing semantic web applications. In this paper, we present new approaches for scalable Resource Description Framework Schema (RDFS) reasoning. Our RDFS specific term-based partitioning algorithm determines required schema elements for each data partition while eliminating the data partitions that will not produce any inferences. With the two-level partitioning approach, we are able to carry out reasoning with limited resources. In our hybrid approach, we integrate two previously mentioned methods to benefit from the advantages of both. In the experimental tests we achieve linear speedups for reasoning times with the proposed hybrid approach. These algorithms and methods presented in the paper enable RDFS-level reasoning of large data sets with limited resources, and they together build up a scalable distributed reasoning approach.

___

  • [1] Berners-Lee T, Hendler J, Lassila O. The semantic web. Sci Am 2001; 284: 34–43.
  • [2] Antoniou G, Harmelen VF. A Semantic Web Primer. 2nd ed. Cambridge, MA, USA: MIT Press, 2008.
  • [3] Bizer C, Mika P. The semantic web challenge. J Web Semant 2009; 8: 341–374.
  • [4] Li P, Zeng Y, Kotoulas S, Urbani J, Zhong N. The quest for parallel reasoning on the semantic web. In: 5th International Conference on Active Media Technology; 22–24 October 2009; Beijing, China.
  • [5] Fang Q, Zhao Y, Yang G, Zheng W. Scalable distributed ontology reasoning using DHT-based partitioning. In: 3rd Asian Semantic Web Conference; 2008; Bangkok, Thailand. Berlin, Germany: Springer-Verlag. pp. 91–105.
  • [6] Kaoudi Z, Miliaraki I, Koubarakis M. RDFS reasoning and query answering on top of DHTs. In: 7th International Semantic Web Conference; 2008; Karlsruhe, Germany. Berlin, Germany: Springer-Verlag. pp. 499–516.
  • [7] Soma R, Prasanna VK. A data partitioning approach for parallelizing rule based inferencing for materialized owl knowledge bases. In: 21st ISCA International Conference on Parallel and Distributed Computing and Communication Systems; 2008; New Orleans, LA, USA. pp. 19–25.
  • [8] Soma R, Prasanna VK. Parallel inferencing for OWL knowledge bases. In: International Conference on Parallel Processing; 2008. pp. 75–82.
  • [9] Urbani J, Kotoulas S, Oren E, Harmelen FV. Scalable distributed reasoning using MapReduce. In: 8th International Semantic Web Conference; 2009. pp. 634–649.
  • [10] Weaver J, Hendler J. Parallel materialization of the finite RDFS closure for hundreds of millions of triples. In: 8th International Semantic Web Conference; 2009. pp. 682–697.
  • [11] Kotoulas S, Oren E, Harmelen FV. Mind the data skew: Distributed inferencing by speeddating in elastic regions. In: 19th International World Wide Web Conference; 2010. pp. 531–540.
  • [12] Oren E, Kotoulas S, Anadiotis G, Siebes R, Teije AT, Harmelen FV. Marvin: Distributed reasoning over large-scale semantic web data. J Web Sem 2009; 7: 305–316.
  • [13] Oren E, Kotoulas S, Anadiotis G, Siebes R, Teije AT, Harmelen FV. MARVIN: A platform for large-scale analysis of Semantic Web data. In: WebScience 2009: Society On-Line.
  • [14] Ding L, Finin T. Characterizing the semantic web on the web. In: International Semantic Web Conference; 2006. pp. 242–257.
  • [15] Horst HJ. Completeness, decidability and complexity of entailment for RDF schema and a semantic extension involving the OWL vocabulary. J Web Semant 2005; 3: 79–115.
  • [16] Carroll JJ, Dickinson I, Dollin C, Reynolds D, Seaborne A, Wilkinson K. Jena: implementing the semantic web recommendations. In: 13th International World Wide Web Conference; 2004. New York, NY, USA: ACM Press.pp. 74–83.
  • [17] Aleman-Meza B, Hakimpour F, Arpinar IB, Sheth AP. SwetoDblp ontology of computer science publications. J Web Semant 2007; 5: 151–155.
  • [18] Ley M. The DBLP computer science bibliography: evolution, research issues, perspectives. Lect Notes Comput Sc 2002; 2476: 481–486.
  • [19] Kulahcioglu T, Bulut H. Overcoming limitations of term-based partitioning for distributed RDFS reasoning. In: 5th Workshop on Semantic Web Information Management; 2013. New York, NY, USA: ACM. pp. 7:1–7:4.
Turkish Journal of Electrical Engineering and Computer Sciences-Cover
  • ISSN: 1300-0632
  • Yayın Aralığı: Yılda 6 Sayı
  • Yayıncı: TÜBİTAK