More about HKUST
Making Data Communication Effective and Efficient in Computational Notebooks
PhD Thesis Proposal Defence
Title: "Making Data Communication Effective and Efficient in Computational
Notebooks"
by
Miss Yanna LIN
Abstract:
In the era of big data, data science has become pivotal in extracting and
generating insights from vast amounts of structured and unstructured data.
Computational notebooks have emerged as essential tools in this field,
integrating codes, outputs, and explanatory texts to create a computational
narrative that enhances the exploration and communication of complex data
insights. Despite their widespread adoption, significant challenges persist in
ensuring effective data communication within these notebooks, particularly
related to the quality of explanatory texts, the integration of texts, codes,
and outputs, and the diverse needs of stakeholders.
This thesis addresses these challenges through the development of innovative
solutions aimed at improving the efficiency and effectiveness of data
communication within computational notebooks. In the first work, we introduced
InkSight, a mixed-initiative plugin that automatically generates explanatory
texts for chart outputs based on users' intents expressed through sketches.
Recognizing that users still face difficulties in relating the explanatory
texts to the corresponding charts and codes, we designed a second plugin,
InterLink, to help clarify the relationships and cross-references between these
elements. Beyond facilitating continued data exploration, some stakeholders
prefer alternative formats of computational notebooks, such as data comics, to
gain high-level insights and avoid the clutter of interim notes and findings.
To address this, in the third work, we proposed DMiner, a data-driven framework
that automates the layout and interaction designs of selected visualizations
into data comics, meeting the diverse preferences of different audiences.
Finally, we discuss future research directions to further enhance the
efficiency and effectiveness of data communication in computational notebooks.
Date: Friday, 7 June 2024
Time: 3:00pm - 5:00pm
Venue: Room 3494
Lifts 25/26
Committee Members: Prof. Huamin Qu (Supervisor)
Prof. Cunsheng Ding (Chairperson)
Dr. Wei Zeng
Dr. Lionel Parreaux