Grammar-based Compression for Unrooted Trees and Subclass of Plane Graphs Using Integer Linear Programming
全文
(2) Vol.2016-MPS-108 No.11 Vol.2016-BIO-46 No.11 2016/7/4. IPSJ SIG Technical Report. C. B. A. A. C A. B. C. B. Fig. 3 Additional production rules for a planar graph.. n. N. N. n N H. N. c. n. n. c n. n m. m. Ei0 ,S 0 , j0 ,T 0 ∪ Ei00 ,S 00 , j00 ,T 00 = Ei,S , j,T , Ei0 ,S 0 , j0 ,T 0 ∩ Ei00 ,S 00 , j00 ,T 00 = ∅, Ei0 ,S 0 , j0 ,T 0 , ∅, and Ei00 ,S 00 , j00 ,T 00 , ∅. I(G) contains a large number of elements for unordered trees and planar graphs, and the number of nonterminal symbols becomes also large. We consider an ordered version of these grammars, and represent S ∈ E(i) by two nodes, starting and ending nodes in clockwise direction. Then, a nonterminal symbol is represented by Gi,s,e, j,t, f for s, e ∈ E(i) and t, f ∈ E( j) instead of Gi,S , j,T , and we propose the following integer linear programming formulation for finding the minimum grammar representing only a given plane graph G. min. S. A. B. B. F. A. C. D. H. I. H. I. X. pu. u∈S(G). subject to x,∅,,∅ = 1, xi,s,e, j,t, f = 1 for all (i, s, X e, j, t, f ) ∈ I(G) s. t. |Ei,s,e, j,t, f | = 1, xi,s,e, j,t, f ≤ yi0 ,s0 ,e0 , j0 ,t0 , f 0 ,i00 ,s00 ,e00 , j00 ,t00 , f 00 (i0 ,s0 ,e0 , j0 ,t0 , f 0 ,i00 ,s00 ,e00 , j00 ,t00 , f 00 )∈C(Gi,s,e, j,t, f ). for all (i, s, e, j, t, f ) ∈ I(G) s. t. |Ei,s,e, j,t, f | ≥ 2, 1 y ≤ (xi0 ,s0 ,e0 , j0 ,t0 , f 0 + xi00 ,s00 ,e00 , j00 ,t00 , f 00 ), 2 su ≤ pu < 1 + su for all u ∈ S(G), X 1 su = xi,s,e, j,t, f . |E| {(i,s,e, j,t, f )∈I(G)|Gi,s,e, j,t, f is isomorphic to u} i0 ,s0 ,e0 , j0 ,t0 , f 0 ,i00 ,s00 ,e00 , j00 ,t00 , f 00. C. D. E. D. c. E. F. F. G. n 3.. F. G. G. I. m. Fig. 4 Example of a plane graph and its grammar. The edge-labeled graph (the upper right) is transformed from the purine chemical compound (the upper left). Dashed circles denote nitrogen atoms.. tices and a set E of labeled edges. Consider the following cutting procedure from G to G0 (V 0 , E 0 ) at a vertex i with a set S of vertices, where S ⊂ E(i) = { j ∈ V|(i, j) ∈ E}. (i) add a new vertex i0 to G (i.e., V 0 = V ∪ {i0 }). (ii) replace edges connected to i which endpoints are in S with edges connected to i0 (i.e., E 0 = E ∪ {(i0 , j)| j ∈ S } − {(i, j)| j ∈ S }). Then, if G0 is not connected and consists of two connected components, we say that G is separable with i and S . In a similar way, we consider the cutting procedure from G at two vertices i and j with two sets S and T , where S ⊂ E(i) and T ⊂ E( j). Gi,S , j,T (Vi,S , j,T , Ei,S , j,T ) is defined as a connected subgraph of G obtained by the cutting procedure, which includes non-empty sets S , T , does not include E(i) − S and E( j) − T , where i, j ∈ V ∪ {}, S ⊂ E(i), T ⊂ E( j), means nothing. G is represented by G,∅,,∅ . Gi,S , j,T is distinguished from G j,T,i,S in almost all cases. Suppose that S(Gi,S , j,T ) and I(Gi,S , j,T ) are sets of all subgraphs Gi0 ,S 0 , j0 ,T 0 and its indices (i0 , S 0 , j0 , T 0 ), respectively, of Gi,S , j,T by the cutting procedure. Consider the case that Gi,S , j,T is decomposed into Gi0 ,S 0 , j0 ,T 0 and Gi00 ,S 00 , j00 ,T 00 . Let C(Gi,S , j,T ) be a set of all index combinations (i0 , S 0 , j0 , T 0 , i00 , S 00 , j00 , T 00 ) that Vi0 ,S 0 , j0 ,T 0 ∪ Vi00 ,S 00 , j00 ,T 00 = Vi,S , j,T , Vi0 ,S 0 , j0 ,T 0 ∩ Vi00 ,S 00 , j00 ,T 00 = {i, j}, ⓒ 2016 Information Processing Society of Japan. Discussion. In this study, we proposed the definition of grammars for unrooted trees and some subclass of planar graphs, and integer linear programming-based method for finding the minimum grammar representing only a given plane graph. We can prove that the proposed grammar generates only a planar graph. It, however, is difficult to concretely show the subclass of planar graphs that the grammar can generate, and to determine whether or not our method can be applied to a given plane graph. As future work, we would like to uncover the subclass that the grammar can generate, apply the proposed method to actual plane graphs, and extend the grammar to more complicated structures.. Acknowledgements This work was partially supported by Grant-in-Aid #16K00392 from MEXT, Japan. References [1]. [2]. Zhao, Y., Hayashida, M., Cao, Y., Hwang, J. and Akutsu, T.: Grammarbased compression approach to extraction of common rules among multiple trees of glycans and RNAs, BMC Bioinformatics, Vol. 16, p. 128 (2015). Akutsu, T.: A bisection algorithm for grammar-based compression of ordered trees, Information Processing Letters, Vol. 110, pp. 815–820 (2010).. 2.
(3)
図
関連したドキュメント
Making use of Linear operator theory, we define a new subclass of uniformly convex functions and a corresponding subclass of starlike functions with nega- tive coefficientsG. The
Theorem 1. Tarnanen uses the conjugacy scheme of the group S n in order to obtain new upper bounds for the size of a permutation code. A distance that is both left- and right-
Motivated by the new perturbation results of closed linear generalized inverses [12], in this paper, we initiate the study of the following problems for bounded homogeneous
As just mentioned, the method used for recognizing circulant graphs is based on the notions of coherent configurations and Schur rings generated by graphs and on the in-
Chapoton pointed out that the operads governing the varieties of Leibniz algebras and of di-algebras in the sense of [22] may be presented as Manin white products of the operad
Upon deleting the tripleton part, and recalling our earlier analysis of the tripartite pairing (Theorem 3.1), we see that each partition τ contributes to σ a sign which involves
3 by two simple examples: we first give another solution of (2) obtained when m = 2, and then a generating function proof of MacMahon’s formula for the number of standard tableaux of
Definition 18 A total labeling of a finite leaf labeled tree with leaves labeled from a totally ordered set such as N ∪ {∞} is a maxmin labeling if each internal vertex, v, has label