The CityU’s Data Science Day is organized by the School of Data Science (SDSC) and the Hong Kong Institute for Data Science (HKIDS).
The Day will begin by President Way Kuo’s opening address. World-renowned Artificial Intelligence scholar and entrepreneur, Prof Wen Gao, will deliver the keynote speech through online platform. The Day will feature CityU's HK Tech 300 Venture and panel discussion on leveraging the venture to catalyze the impact in AI and Data Science will be held in the late morning. Technical sessions to showcase HKIDS project presentations by faculty members across CityU and project sharing by SDSC PhD Students are in the afternoon session.
The Data Science Day also celebrates the occasion of SDSC’s and HKIDS’s third-anniversary.
Date: 10 August 2021 (Tuesday)
Time: 9:00 am – 4:45 pm (Registration starts at 8:30am)
Format: Online Participation via Zoom
Highlights:
Opening Remarks by the President
SDSC and HKIDS Status Updates
Keynote Speech by Dr Wen Gao
Introduction of CityU's HK Tech 300 Venture
Panel Discussion by Deans of Colleges and Schools
HKIDS PI's Research Review
Project Sharings by PhD students
Dean and Chair Professor, School of Data Science;
Director, Hong Kong Institute for Data Science, CityU
Prof Wen GAO
Academician of Chinese Academy of Engineering; Director of Peng Cheng Laboratory, Shenzhen, China;
Boya Chair Professor, and Dean at the School of Electronics Engineering and Computer Science, Peking University
Topic: Peng Cheng Cloud Brain Open Source Ecosystem
Abstract:
Peng Cheng Laboratory (PCL) is a new type of scientific research institution in the field of network communications in China. PCL focuses on the strategic, forward-looking, original scientific research and core technology development in the related fields. Peng Cheng Cloud Brain II is the first independent E-level artificial intelligence supercomputing platform in China. It is open source with ultra-high computing density, super large-scale computing power and ultra-fast training speed. It is going to create ecological environment to support future scientific research and empower industrial development
Dean and Chair Professor, School of Data Science;
Director, Hong Kong Institute for Data Science, CityU
Associate Vice-President (Strategic Research), CityU
Dean, College of Engineering, CityU
Dean, School of Creative Media CityU
Director, Laboratory for AI-Powered Financial Technologies Limited
Chair Professor, Department of Media and Communication;
Chair Professor, School of Data Science, CityU
Chair Professor, Department of Electrical Engineering, CityU
Dean, College of Liberal Arts and Social Sciences;
Chan Hon Pun Professor of Behavioural and Policy Sciences;
Chair Professor, Department of Public Policy, CityU
Associate Professor, Department of Electrical Engineering, CityU
Associate Professor, Department of Computer Science, CityU
Associate Professor, Department of Linguistics and Translation, CityU
Presented by:
Prof Jonathan ZHU
Chair Professor, Department of Media and Communication;
Chair Professor, School of Data Science, CityU
Project Introduction:
The ongoing COVID-19 pandemic has once again demonstrated the complexity, vulnerability, and unpredictability of the contemporary international relations. Of various research traditions on global studies, network analysis has been an increasingly popular approach to help uncover hidden, indirect, and multilateral dynamics underlying the global structure. In the current study, we apply ego-network analysis, a set of useful but less known tools, to explore the ongoing reconfiguration of the global politico-economic ecosystems based on crossnational flow of goods, capital and information. While our findings are preliminary, the ego-network approach appears to be able to offer a variety of unexpected insights with important policy implications.
Presented by:
Prof Hong YAN
Chair Professor, Department of Electrical Engineering, CityU
Project Introduction:
This project started in April this year. We aim to investigate big data processing techniques for performance analysis, prediction and control of smart factories. We will study effective methods for manufacturing data filtering and imputation to improve the data quality. Statistical models, machine learning algorithms and optimization techniques will be proposed for data classification and analysis. Computer software will be developed for automated and intelligent control of smart factories.
Presented by:
Dr Yuner ZHU
Postdoc, Department of Public Policy, CityU
Project Introduction:
Digital communication technologies have played a critical role in modern-day protests around the globe in the last decade. Labeled as “crowd-enabled connective actions,” the mobilizing structure of networked social movements may differ fundamentally from professional social movements led by movement organizations or political parties. These networked social movements can resolve free-rider problems of collective action by providing information networks and frame alignment to its participants, which undermines the need for formal leadership.
The Hong Kong protests have operated across diverse traditional media and social media platforms (a Reddit-like forum, Telegram channels, Instagram graphics, and Facebook pages) in a cosmopolitan city with an extremely high level of social media and smartphone penetration rate. This setting is ideal for big data analysis. In this presentation, we provided a preliminary topic and network analysis of 20 million comments over 44 weeks from 2019 to 2020 on LIHKG.com. We analyze all posts with those in the public affairs forum to understand the relationship between different topics and drill down to unpack post volumes, trend, pattern, centrality rank, degree distribution, and core-periphery comments. We identify that the pattern of user activity is highly correlated to offline events. We then reveal how specific topics and social networks form frame alignment. We also identified a number of hyper-active users who contributed frequently to the discussions related to the protests and found that over time a few dozen user accounts gained in-group and out-group recognition in different communities. In other words, while there were no authoritative individuals or organizations in mobilizing and framing the protests, a "networked and leaderful structure" helped distribute the role of information dissemination and frame coordination. In conclusion, we outline the value of big data analysis of social phenomena and point to the challenges it presents for social science research.
Presented by:
Dr Ka Chun WONG
Associate Professor, Department of Computer Science, CityU
Project Introduction:
The early detection of cancers has the potential to save many lives. A recent attempt has been demonstrated successful. However, we note several critical limitations. Given the central importance and broad impact of early cancer detection, we aspire to address those limitations. We explore different supervised learning approaches for multiple cancer type detection and observe significant improvements; for instance, one of our approaches (i.e., CancerA1DE) can double the existing sensitivity from 38% to 77% for the earliest cancer detection (i.e., Stage I) at the 99% specificity level. For Stage II, it can even reach up to about 90% across multiple cancer types. In addition, CancerA1DE can also double the existing sensitivity from 30% to 70% for detecting breast cancers at the 99% specificity level. Data and model analysis are conducted to reveal the underlying reasons. A website is built at http://cancer.cs.cityu.edu.hk/
Presented by:
Dr Yanni SUN
Associate Professor, Department of Electrical Engineering, CityU
Project Introduction:
Microbial communities, which contain microbes sharing the same living space, ubiquitously exist anywhere from various human body sites to different environmental niches. Elucidating the composition and functional profiles of the microbiome (i.e. the ensemble of all the microbes in a niche) is essential in many fields including pathogen discovery, ecology, epidemiology etc. Applying modern sequencing technologies to sequence all the genetic materials in various samples has become the most popular method for studying microbial communities. Massive amount of sequencing data has been accumulated, posing significant computational challenges for converting the data into knowledge. In this short talk, we will share the ongoing researches about this topic at CityU.
Presented by:
Dr John LEE
Associate Professor, Department of Linguistics and Translation, CityU
Project Introduction:
Text readability assessment aims to analyze the difficulty of a document, and predict the school grade for which the document is most suitable. Classic approaches rely largely on surface cues, such as word frequency and sentence length. Syntactic and semantic complexity, however, also have significant impact on text difficulty. In this talk, we present a level-annotated corpus of Chinese textbook materials, and report preliminary results of a readability assessment system that takes not only lexical, but also syntactic and semantic complexity into account.
8:30 am - 9:00 am
9:00 am - 9:15 am
Presented by Prof Way KUO
President and University Distinguished Professor
9:15 am - 9:35 am
Presented by Prof S. Joe QIN
Dean and Chair Professor, School of Data Science;
Director, Hong Kong Institute for Data Science, CityU
9:35 am - 10:35 am
Topic: Peng Cheng Cloud Brain Open Source Ecosystem
Presented by Prof Wen GAO
Academician of Chinese Academy of Engineering;
Director of Peng Cheng Laboratory, Shenzhen, China;
Boya Chair Professor, and Dean at the School of Electronics Engineering and Computer Science, Peking University
10:35 am - 11:05 am
Presented by Prof Michael YANG
Vice President (Research and Technology), CityU
11:05 am - 11:10 am
11:10 am - 12:30 pm
Facilitator:
Prof S. Joe QIN
Dean and Chair Professor, School of Data Science;
Director, Hong Kong Institute for Data Science, CityU
-----------------------------------------------------
Panel members:
Prof Michael TSE
Associate Vice-President (Strategic Research), CityU
Prof Tei-Wei KUO
Dean, College of Engineering, CityU
Prof Richard ALLEN
Dean, School of Creative Media, CityU
Prof Houmin YAN
Director, Laboratory for AI-Powered Financial Technologies Limited
12:30 pm - 2:00 pm
2:00 pm - 4:00 pm
Facilitated by:
Prof Minghua CHEN
Professor, School of Data Science;
Assistant Director, Hong Kong Institute for Data Science, CityU
Prof Ding-xuan ZHOU
Associate Dean and Chair Professor, School of Data Science, CityU
-----------------------------------------------------
Presented by
Prof Jonathan ZHU - Using Network Science to Evaluate and Enhance Hong Kong’s Bridging Role in One Belt One Road (OBOR)
Prof Hong YAN - Big-data-driven Performance Analysis, Prediction and Control of Smart Factories
Dr Yuner ZHU - Unstructured Data, Structured Analysis Using Digital Trace Data to Advance the Understanding of Digital Activism
(Principal Investigator: Prof Richard WALKER)
Dr Ka Chun WONG - Genomic Data Search and Analytics with Applications to Colorectal Cancer Subtype Classification
Dr Yanni SUN - Towards More Accurate Virus Classification Using Graph Convolutional Network
Dr John LEE - Towards Linguistically-motivated Text Readability Assessment for Chinese Learning in Hong Kong
4:00 pm - 4:40 pm
Chaired by Prof Junhui WANG
Professor, School of Data Science
Project:
1. The named entity recognition for theory detection in social research paper - by Miss Siyi ZHOU
(Supervisor: Prof Jonathan ZHU)
2. Constrained Bayesian Optimization of a Time -Consuming Simulator Under Noise Input Variations - by Mr Yidong WANG
(Supervisor: Dr Matthias TAN)
3. AI method for anti cancer drug identification - by Mr Jiannan YANG
(Supervisor: Dr Qingpeng ZHANG)
4. AI model of human mobility in Hong Kong - by Mr Hanchu ZHOU
(Supervisor: Dr Qingpeng ZHANG)
5. Intelligent Railway Foreign Object Detection: A Semi-supervised Deep Learning Method - by Miss Tiange WANG
(Supervisor: Dr Zijun ZHANG)
6. Generative Probabilistic Time Series Forecasting with Variation Recurrent Autoencoders - by Mr Zhong ZHENG
(Supervisor: Dr Zijun ZHANG)
7. Value-Gradient based Formulation of Optimal Control Problem and Machine Learning Algorithm - by Miss Jiayue HAN
(Supervisor: Dr Xiang ZHOU)
8. Signal Region Detection in Image Regression - by Mr Sanyou WU
(Supervisor: Dr Long FENG)
9. Data Intelligence for Better Fuel Planning and Fuel Efficiency at Airlines - by Miss Xinting ZHU
(Supervisor: Dr Lishuai LI)
10. Functional Analysis of Kinect Sensor Data to Assess Mobility of the Elderly - by Mr Cheng CAO
(Supervisor: Dr Xinyue LI)
11. Parametric Portfolio Policies in Online Marketplace Lending - by Mr Zonghao YANG
(Supervisor: Dr Xiao QIAO)
12. Integer Lattice Submodular Cover: Application to Viral Marketing - by Mr Qixin ZHANG
(Supervisor: Dr Yu YANG)
4:40 pm - 4:45 pm
Presented by Prof S. Joe QIN
Dean and Chair Professor, School of Data Science;
Director, Hong Kong Institute for Data Science, CityU
Site was made with Mobirise