Functional Dependency Extraction with Sequential Indexed Search Trees

Main Article Content

B. Tusor
A.R. Vàrkonyi-Kòczy
S. Gubo

Abstract

Functional dependency analysis is an important field of data science, where the goal is to determine the relationships between different data attributes and attribute sets in a given data set. This can lead to gaining valuable information about the data that is often not evident through surface-level analysis. In previous work, the authors proposed a functional dependency extraction method called Sequential Indexing Tables, which is a specialized variant of the Sequential Fuzzy Indexing Tables (SFITs) classifiers. SFITs combined lookup table classifiers with fuzzy logic to implement a very fast yet flexible classification. A special feature of the SFIT classifier is that its structure indicates the functional dependencies between the data attributes that are present in the training data set. However, the main disadvantage of SFITs is that they require a significant part of the problem space to be stored in the computer memory, scaling exponentially with the number of attributes. To solve this issue, a new classifier called Sequential Fuzzy Indexed Search Trees (SFISTs) has been proposed by the authors, which builds on the same idea as SFITs but uses a more compact structure while providing a slightly better classification accuracy. In this paper, the functional dependency detection and extraction method is presented, which is a specialized version of the SFISTs classifier that uses the same base idea as its predecessor but with a much smaller spatial complexity. 


 

Article Details

How to Cite
[1]
B. Tusor, A. Vàrkonyi-Kòczy, and S. Gubo, “Functional Dependency Extraction with Sequential Indexed Search Trees”, Acta Phys. Pol. A, vol. 146, no. 4, p. 515, Nov. 2024, doi: 10.12693/APhysPolA.146.515.
Section
Special segment

References

E.F. Codd, Commun. ACM 13, 377 (1970)

G.N. Paulley, Ph.D. Thesis, University of Waterloo, 2000

P. Bohannon, W. Fan, F. Geerts, in: Proc. of the Int. Conf. on Data Engineering (ICDE), 2007, p. 746

M. Le Guilly, J.M. Petit, V.M. Scuturici, in: Transactions on Large-Scale Data-and Knowledge-Centered Systems XLIV, Eds. A. Hameurlain, A.M. Tjoa, P. Lamarre, K. Zeitouni, Springer, Berlin 2020 p. 132

H. Yao, H.J. Hamilton, Data Min. Knowl. Discov. 16, 197 (2008)

J. Liu, J. Li, C. Liu, Y. Chen, IEEE Trans. Knowl. Data Eng. 24, 251 (2012)

A.R. Vàrkonyi-Kòczy, B. Tusor, J.T. Tòth, in: Recent Global Research and Education: Technological Challenges. Advances in Intelligent Systems and Computing, Vol. 519, Eds. R. Jabłoński, R. Szewczyk, Springer, Cham 2017, p. 403

B. Tusor, J.T. Tòth, A.R. Vàrkonyi-Kòczy, in: IEEE 23rd Int. Conf. on Intelligent Engineering Systems, IEEE, 2019, p. 307

B. Tusor, J.T. Tòth, A.R. Vàrkonyi-Kòczy, Acta Polytech. Hung. 16, 65 (2019)

B. Tusor, O. Takàč, Š. Gubo, A.R. Vàrkonyi-Kòczy, in: Recent Advances in Technology Research and Education, Lecture Notes in Networks and Systems (Inter-Academia 2023), Vol. 939, Eds. Y. Ono, J. Kondoh, Springer, Cham 2024, p. 294

D. Dua, C. Graff, The UC Irvine Machine Learning Repository, 2024