The Taub Faculty of Computer Science Events and Talks
Wednesday, 07.06.2023, 11:30
Many datasets in real life are complex and dynamic, that is, their key densities are varied over the whole key space and their key distributions change over time. It is challenging for an index structure to efficiently support all key operations for data management, in particular, search, insert, and scan, for such dynamic datasets. In this talk, I will present DyTIS (Dynamic dataset Targeted Index Structure), an index that targets dynamic datasets. DyTIS, though based on the structure of Extendible hashing, leverages the CDF of the key distribution of a dataset, and learns and adjusts its structure as the dataset grows. The key novelty behind DyTIS is to group keys by the natural key order and maintain keys in sorted order in each bucket to support scan operations within a hash index. We also define what we refer to as a dynamic dataset and propose a means to quantify its dynamic characteristics. Our experimental results show that DyTIS provides higher performance than the state-of-the-art learned index for the dynamic datasets considered.
Sam H.(Hyuk) Noh received the BE degree in computer engineering from the Seoul National University, Seoul, Korea, in 1986, and the PhD degree from the Department of Computer Science, University of Maryland, College Park, MD, in 1993. He held a visiting faculty position at the George Washington University, Washington, DC, from 1993 to 1994 before joining Hongik University, in Seoul, Korea, where he was a professor in the School of Computer and Information Engineering until the Spring of 2015. During this period, he served as the Chair of the Department of Computer Engineering as well as the Head of the School from September of 2013 through February of 2015. From August 2001 to August 2002, he was also a visiting associate professor with the University of Maryland Institute of Advanced Computer Studies (UMIACS), College Park, MD. Starting from the Fall of 2015 he joined UNIST (Ulsan National Institute of Science and Technology), a young science and tech focused national university, where he was a Professor at the Department of Computer Science and Engineering and served as the inaugural Dean of the Graduate School of Artificial Intelligence in the College of Information and Biotechnology from August 2020 through March 2023. He also served as the Dean of the School of Electrical and Computer Engineering from January of 2016 through June of 2018. As of January 2023, he is a Professor at the Computer Science Department at Virginia Tech. He has served/serves as General Chair, Program Chair, and Program Committee Member on a number of technical conferences and workshops. He also served on the Steering Committee of LCTES from 2016 through 2020. He is currently the Chair of the Steering Committee for ACM HotStorage and a Steering Committee member of USENIX FAST and IEEE NVMSA. He also served as Editor-in-Chief of the ACM Transactions on Storage from 2016 through 2022. His research interests include system software issues pertaining to computer systems in general and storage systems in particular, with a focus on the use of new memory technologies such as flash memory and persistent memory. He is a Fellow of the ACM and IEEE and a member of USENIX and KIISE (Korean Institute of Information Scientists and Engineers).