This study analyzes the evolution of data science vocabulary using 16,018 abstracts containing "data science" over 13 years. It identifies new vocabulary introduction and its integration into scientific literature using techniques like EDA, LSA, LDA, and N-grams. The research compares overall scientific publications with those specific to Saudi Arabia, identifying representative articles based on vocabulary usage. Why it matters: The work provides insights into the development of data science terminology and its specific adoption within the Saudi Arabian research landscape.
KAUST held a research conference on Computational and Statistical Interface to Big Data from March 19-21. The conference covered topics like data representation, visualization, parallel algorithms, and large-scale machine learning. Participants came from institutions including the American University of Sharjah, Aalborg University, and others to exchange ideas. Why it matters: The conference highlights KAUST's focus on promoting big data research and collaboration to address challenges and opportunities in various scientific fields within the Kingdom and globally.
This paper focuses on analyzing surveys of women entrepreneurs in the UAE using machine learning techniques. The goal is to extract relevant insights from the data to understand the current landscape and predict future trends. The study aims to support better business decisions related to women in entrepreneurship.
KAUST hosted a regional Women in Data Science (WiDS) conference, part of a global event held at over 100 regional institutions led by Stanford University. The KAUST event featured exclusively female speakers and aimed to highlight data science research and applications. KAUST is launching a 'Women in Data Sciences and Technology' initiative to support women's education and careers in the field. Why it matters: This initiative can help address the underrepresentation of women in data science in Saudi Arabia and the broader region.
KAUST Professor Marc Genton and his former postdoc Stefano Castruccio jointly won the 2017 Wilcoxon Award for their paper in Technometrics. Their paper, "Compressing an ensemble with statistical models: An algorithm for global 3D spatio-temporal temperature," details a data-compression scheme for climate simulations. The method reduces data-storage requirements and accelerates climate research capacity. Why it matters: This award highlights KAUST's contribution to statistical methods for climate modeling and big data analysis, particularly relevant for studying renewable energy resources in Saudi Arabia.
Emilio Porcu from Khalifa University presented on temporally evolving generalized networks, where graphs evolve over time with changing topologies. The presentation addressed challenges in building semi-metrics and isometric embeddings for these networks. The research uses kernel specification and network-based metrics and is illustrated using a traffic accident dataset. Why it matters: This work advances the application of kernel methods to dynamic graph structures, relevant for modeling evolving relationships in various domains.
KAUST Ph.D. student Zhuo Qu and fellow students from the Statistics Program launched the first American Statistical Association (ASA) student chapter outside of the U.S. in October 2019. The chapter aims to encourage and provide opportunities for KAUST students interested in statistics to connect with statisticians worldwide. In 2020, the chapter plans to organize seminars and connect students interested in statistics and data mining. Why it matters: This initiative highlights KAUST's commitment to fostering a global network of statisticians and promoting data analysis skills among its students, enhancing its role as a hub for international collaboration in STEM fields.
KAUST Assistant Professor of Statistics Ying Sun won the 2016 Abdel El-Shaarawi Young Researcher (AEYR) Award in June. The award recognizes young researchers for contributions to statistics and related fields. Why it matters: This highlights KAUST's commitment to attracting and recognizing talented researchers in data science and related fields.