Making Data Communication Effective and Efficient in Computational Notebooks

PhD Thesis Proposal Defence


Title: "Making Data Communication Effective and Efficient in Computational 
Notebooks"

by

Miss Yanna LIN


Abstract:

In the era of big data, data science has become pivotal in extracting and 
generating insights from vast amounts of structured and unstructured data. 
Computational notebooks have emerged as essential tools in this field, 
integrating codes, outputs, and explanatory texts to create a computational 
narrative that enhances the exploration and communication of complex data 
insights. Despite their widespread adoption, significant challenges persist in 
ensuring effective data communication within these notebooks, particularly 
related to the quality of explanatory texts, the integration of texts, codes, 
and outputs, and the diverse needs of stakeholders.

This thesis addresses these challenges through the development of innovative 
solutions aimed at improving the efficiency and effectiveness of data 
communication within computational notebooks. In the first work, we introduced 
InkSight, a mixed-initiative plugin that automatically generates explanatory 
texts for chart outputs based on users' intents expressed through sketches. 
Recognizing that users still face difficulties in relating the explanatory 
texts to the corresponding charts and codes, we designed a second plugin, 
InterLink, to help clarify the relationships and cross-references between these 
elements. Beyond facilitating continued data exploration, some stakeholders 
prefer alternative formats of computational notebooks, such as data comics, to 
gain high-level insights and avoid the clutter of interim notes and findings. 
To address this, in the third work, we proposed DMiner, a data-driven framework 
that automates the layout and interaction designs of selected visualizations 
into data comics, meeting the diverse preferences of different audiences. 
Finally, we discuss future research directions to further enhance the 
efficiency and effectiveness of data communication in computational notebooks.


Date:                   Friday, 7 June 2024

Time:                   3:00pm - 5:00pm

Venue:                  Room 3494
                        Lifts 25/26

Committee Members:      Prof. Huamin Qu (Supervisor)
                        Prof. Cunsheng Ding (Chairperson)
                        Dr. Wei Zeng
                        Dr. Lionel Parreaux