Data Analytics Practice Opportunity 2022/23
Organizing Party: Research Support and Digital Initiatives, CUHK Library
Presentation Session (14 Feb 2023): Details and Registration
Introduction
The Data Analytics Practice Opportunity is organised by the Chinese University of Hong Kong Library. This event aims to:
- encourage reuse of data
- support the exploration of data and metadata from university and library collections
- encourage inter-disciplinary exploration
- provide students with opportunities to develop data analytics projects with real data
- promote data mining and visualization
Details
Participants will develop codes to analyse data and tell stories with the CUHK and the CUHK Library resources. The codes should be adaptive to other datasets in the future as far as possible. The codes developed will be installed to the Library’s Digital Scholarship Tools platform (http://dstools.lib.cuhk.edu.hk) when the project is completed. A webpage and poster about the project will be created. Participants will also present the project at a sharing session.
Students participating in the Practice Opportunity will develop:
- visualisation codes
- project webpage
- project poster
- presentation of the project
Resources
- Data in the CUHK Digital Repository
- Metadata in the CUHK Library Archival Collections
- Data in the CUHK Research Data Repository
Sample data:
Name of collection/data | Sample research questions |
1. Biographical Relationship of Writers or Artists in Hong Kong | The stylometry analysis of Hong Kong writers/artists with reference to the interpersonal relationship |
2. 天文臺 | Storytelling on Hong Kong between 1950 and 1985 using intertextuality detection algorithm |
3. Cantonese Chanting in Hong Kong | Exploring the pitch-text relationship in poems using natural language processing |
4. 走馬樓三國吳簡.嘉禾吏民田家莂資料庫 | Reviewing the economic development in the Three Kingdoms of Eastern Han Dynasty using digital ethnography |
5. Buddhism Data (高峯和尚禪要 ; 攝大乘義章) | Exporting text mining/visualization in Buddhism data. |
6. The Hongkong News | Tracing the topics concerned by the society from 1942 to 1945 in Hong Kong using topic modeling with statistical analysis |
7. Chinese Medicine Texts Collection | Connection and Disconnection of ingredients through the lens of network diagram |
8. Metadata from the CUHK Library Archival Collections | Interaction between collections in the subject: Chinese literature – China – Hong Kong using network analysis |
9. The Chinese Student Weekly (中國學生周報) | N-gram and name-entity recognition in The Chinese Student Weekly and tracing a knowledge graph |
10. Millipede Genomes | Genomes assembly of the millipede genomes |
11. COVID-related data | Acceptance of vaccine and containment measures in Hong Kong during the COVID-19 period |
12. Mask-related data | The intervention of mask |
13. Medicine- and health-related data | Reviewing the health-related data deposited in the CUHK Research Data Repository |
Eligibility
All full-time undergraduate and postgraduate* students at CUHK
Number of members in each team: 1–3
Applicants will be invited for selection interview
Number of successful team: 3
* Postgraduate students under Postgraduate Studentship have to seek approval from your own department when you are given an offer.
Selection criteria
- Knowledge on data analytics
- Skills and experience to develop codes
- Subject knowledge
- Ideas on reusing University and Library resources to develop a project
Allowance
Around $8,000 per team
Application process
Complete the application form at https://cloud.itsc.cuhk.edu.hk/webform/view.php?id=13654433 (Library Job Application Reference: SH20220830DI) and upload the following documents:
- CV
- Copy of transcripts
- 100-word statement of purpose with a proposed project topic (for the team*)
*For team application, each team member has to submit the online application form separately. Please state under “supplementary information” the name(s) of your teammate.
Timeline
2–23 Sep 2022 | Application Period |
| Interview of candidates |
30 Sep 2022 | Announcement of successful applicants |
3 Oct 2022 | Briefing session |
Oct 2022–Feb 2023 | Development of projects |
Feb 2023 | Completion of projects, codes, websites, and posters |
| Presentation of projects |
Sample project webpages
https://dsprojects.lib.cuhk.edu.hk/projects/#DA
Project Outcomes in Data Analytics Practice Opportunity 2022/23
Projects Completed:
- Sketch Your Ancient Arts: Deep Learning on Chinese Ink-Wash Paintings
- Text Analysis on Voice & Verse Poetry Magazine
- NLP and Visualization of The Observatory Review from Hong Kong Early Tabloid Newspaper
- Novel De Bruijn Graph Assembler for Millipede Genomes
Sponsor
Intellectual Property
According to the University policy, participants are required to assign to the University the intellectual property of the project outcome. Participants are authorised to use the original source files provided by the organiser only for the Data Analytics Practice Opportunity. They are not allowed to use them for other purposes without the authorisation of the organiser.
Participants must ensure that the codes developed and project output in the Data Analytics Practice Opportunity are original by the participants or legally authorized by the owner of the intellectual property rights. If any third party raises allegations of infringement of intellectual property rights or the legal irregularities, the participants assume all legal responsibility.
Enquiries
For any enquiries, please email the organizer at data@cuhk.edu.hk.