CIAA2003 START ConferenceManager    

XML Schema Containment Checking based on Semi-implicit Techniques

Akihiko Tozawa and Masami Hagiya

Presented at Eighth International Conference on Implementation and Application of Automata (CIAA 2003), July 16-18, 2003 Santa Barbara, CA, USA


Abstract

XML schemas are computer languages defining grammars of XML (Extensible Markup Languages). Containment checking for XML schemas has many applications, and thus is important. Since XML schemas are related to the class of tree regular languages, their containment checking is reduced to the language containment problem for non-deterministic tree automata (NTAs). However, an NTA for a practical XML schema has 10^2-10^3 states for which the textbook algorithm based on naive determinization is expensive. Thus we in this paper consider techniques based on BDDs (binary decision diagrams). We used semi-implicit encoding which encodes a set of subsets of states as a BDD, rather than encoding a set of states by it. The experiment on several real-world XML schemas proves that our containment checker can answer problems that cannot be solved by previously known algorithms.