Data Analytics Practice Opportunity 2024/25

Data Analytics Practice Opportunity 2024/25

Organizing Parties: CUHK Library and Data Science and Policy Studies, School of Governance and Policy Science, CUHK

Introduction

The Data Analytics Practice Opportunity is co-organised by the Chinese University of Hong Kong Library and Data Science and Policy Studies, School of Governance and Policy Science, CUHK. This event aims to:

  1. encourage reuse of data
  2. support the exploration of data and metadata from university and library collections
  3. encourage inter-disciplinary exploration
  4. provide students with opportunities to develop data analytics projects with real data
  5. promote data mining and visualization

Details

Participants will develop codes to analyse data and tell stories with the CUHK and the CUHK Library resources. The codes should be adaptive to other datasets in the future as far as possible. The codes developed will be installed to platform supported by the Library when the project is completed. A webpage and poster about the project will be created. Participants will also present the project at a sharing session.

Students participating in the Practice Opportunity will develop:

  1. visualisation codes
  2. project webpage
  3. project poster
  4. presentation of the project

Resources

  1. Data in the CUHK Digital Repository
  2. Metadata in the CUHK Library Archival Collections
  3. Data in the CUHK Research Data Repository

Suggested research topics:

Each team can select one research topic from the four research topics listed under (A) CUHK Research Data Repository OR a topic under (B) Digital Scholarship and CUHK Digital Repository for their proposed project topic.

Research TopicRelated collection/data
(A) CUHK Research Data Repository
1. Dialogue System and Language Model1. Cantonese Customized Dialogue Dataset-KddRES
2. StackOverflow Q&A Dataset
2. Immigration Detention in Hong Kong1. Immigration Detention and Vulnerable Migrants in Hong Kong
3. COVID-related analysis1. Rumors about COVID-19 on Weibo
2. Stringent containment measures without complete city lockdown to achieve low incidence and mortality across two waves of COVID-19
3. Acceptance of the COVID-19 vaccine based on the health belief model: A population-based survey
4. Other COVID-related datasets deposited in the CUHK Research Data Repository
4. Genomes1. Jellyfish Genomes
2. Sea anemone Genomes
3. Seagrass experimental data
4. Hong Kong Biodiversity Genomics Hub
5. Other genome-related datasets deposited in the CUHK Research Data Repository
(B) Digital Scholarship and CUHK Digital Repository
1. Exploring vegetation cover characteristics from field work notebooks with digital scholarship methodologies1. Shiu-ying Hu Collection
2. Any additional research topics using the data in the CUHK Digital Repository

Eligibility

All full-time undergraduate and postgraduate* students at CUHK
Number of members in each team: 1–3
Applicants will be invited for selection interview
Number of successful team: 3–4

* Postgraduate students under Postgraduate Studentship have to seek approval from your own department when you are given an offer.

Selection criteria

  1. Knowledge on data analytics
  2. Skills and experience to develop codes
  3. Subject knowledge
  4. Ideas on reusing University and Library resources to develop a project

Allowance

Around $8,000 per team

Application process

Complete the application form at https://cloud.itsc.cuhk.edu.hk/webform/view.php?id=13654433 (Library Job Application Reference: SH-20240910RDM) and upload the following documents:

  1. CV
  2. Copy of transcripts
  3. No more than 100-word statement of purpose with a proposed project topic (for the team*)

*For team application, each team member has to submit the online application form separately. Please state the name(s) of your teammate under “Name(s) of Your Teammate”.

Timeline

12 Sep – 4 Oct 2024Application Period
18 Oct 2024Interview of candidates
Fourth week of Oct 2024Announcement of successful applicants
1 Nov 2024Briefing session*
Nov 2024 – Mar 2025Development of projects (including three progress sharing meetings*)
14 Mar 2025 (TBC)Presentation of projects*
End of Mar 2025Completion of projects, codes, websites, and posters
*Light refreshments will be provided.

Sample project webpages

https://dsprojects.lib.cuhk.edu.hk/projects/#DA

Intellectual Property

According to the University policy, participants are required to assign to the University the intellectual property of the project outcome. Participants are authorised to use the original source files provided by the organiser only for the Data Analytics Practice Opportunity. They are not allowed to use them for other purposes without the authorisation of the organiser.

Participants must ensure that the codes developed and project output in the Data Analytics Practice Opportunity are original by the participants or legally authorized by the owner of the intellectual property rights. If any third party raises allegations of infringement of intellectual property rights or the legal irregularities, the participants assume all legal responsibility.

Enquiries

For any enquiries, please email the organizer at data@cuhk.edu.hk.