Generative AI tools can be helpful for making data visualizations, but they often output confusing or misleading charts. Developers should know how to critically evaluate data visualizations output by Generative AI tools.
Developers can supercharge their coding abilities by leaning on Generative AI tools like Claude, ChatGPT, or other LLMS, but ultimately it is up to the developer to know whether a data visualization is accurate or not. Generative AI models can (and do) make mistakes and poor visual design choices. The best solution for developers is to equip themselves with the knowledge to spot these when they happen.
Data visualizations rely on effective, informed design choices to avoid being unintentionally misleading or confusing. Selecting the right chart type, including thoughtful annotations and title, and making appropriate use of color will all help to make charts that communicate clearly and accurately.
Data visualizations need appropriate axes to be truthful and legible. This means avoiding decontextualized breaks and setting the right number of axis ticks – neither too few (numbers are hard to interpret) nor too many (axes are cluttered).
Data visualizations need appropriate scaling to be truthful and legible. A linear scale (where numbers proceed by constant intervals) is almost always the best choice. Logarithmic scales (where numbers proceed exponentially) often cause confusion and should only be used with audiences who are very familiar with reading them.
Recall the example of Purdue pharmaceutical company using a misleading logarithmic scale to minimize the addiction risk of opioid painkillers.
In data visualizations, color associations pull on both helpful prior knowledge or harmful stereotypes. We tend to view darker colors as “more” and lighter colors as “less.” Color associations can also be culturally specific (for instance, red means “bad” or “stop” vs. red means “lucky” or “prosperous”) or influenced by the norms for a particular field (red means “negative financial balance”).
When creating data visualizations, it’s essential to choose the right color palettes to ensure truthfulness, legibility, and accessibility. This involves correctly implementing sequential, diverging, or categorical color palettes and ensuring that there is proper color contrast in your visualizations.
Titles, labels, and annotations are essential for clear and accessible data visualizations. They provide context, making it easier for viewers to understand the chart’s contents and purpose.
Misleading charts often arise from conscious or unconscious bias. Following sound design principles in data visualization reduces the potential for bias. Clear labeling and unbiased data representation are key to maintaining integrity. A well-designed chart not only informs but also builds trust with the audience.