Information Density

Why it matters

A chart is not a picture; it is a sentence made of ink. Every mark on it is either carrying meaning or getting in the way of the marks that do. The trouble is that the marks that get in the way — the drop shadows, the 3-D bevels, the rainbow gradients, the boxed legend eating a third of the canvas — are exactly the ones that look impressive, while the marks that carry the meaning are usually plain. Information density is the discipline of judging a graphic by how much it lets the reader understand per unit of ink and per unit of effort, and then cutting and re-encoding until that ratio is as high as the data will allow.

For example: a quarterly revenue dashboard arrives as a glossy 3-D pie chart with drop shadows, slices in a rainbow gradient, and a legend off to the side that you have to glance back and forth to decode. It looks like a finished, professional object. But ask what the reader is actually being made to do — compare the angles of tilted, shaded wedges and match each color to a word in a far-off key — and it falls apart. Redraw the same five numbers as a plain horizontal bar chart with the labels sitting next to the bars, and the comparison the reader came to make becomes effortless. Nothing was added. A great deal was removed. The second chart is denser with meaning precisely because it is sparser with ink.

What it reveals. Whether a graphic earns its ink — which marks carry data, which carry necessary scaffolding, and which are decoration that could be erased without losing a single fact the reader needs.
How it changes the read. You stop asking “does this chart look polished?” and start asking “what exact comparison is the reader meant to make, and is the encoding I chose the most accurate one available for that comparison?”
When to foreground it. A specific information graphic — a chart, dashboard, table, infographic, map, or typographic page — that has to support a particular reading or decision, and you want prescriptive, mark-by-mark critique rather than a vague “make it cleaner.”
What you’d miss without it. That impressiveness and clarity are usually in tension: the features that make a chart look sophisticated — depth, gloss, saturated color, ornament — are frequently the very features that degrade how accurately a reader can extract the numbers.
Where it misleads. Pushed to dogma, the urge to strip ink becomes minimalism for its own sake — erasing a gridline, a label, or a touch of redundancy that was genuinely helping the reader, and mistaking austerity for communication.

How it works

Start with the simplest possible version of the problem and the rest follows. Suppose you have five numbers — the revenue share of five product lines — and one job: let a reader see, at a glance, which line is biggest and by roughly how much. That job is the whole point of the graphic, and everything else is in service of it or in the way of it.

Edward Tufte gave us the first tool for telling the difference. Imagine taking a highlighter and marking only the ink on the page that is actually showing data — the height of a bar, the position of a dot, the number printed on a slice. Everything else is either structure you genuinely need (an axis, a single scale reference) or it is chartjunk: the drop shadow, the gradient fill, the 3-D extrusion, the heavy box around the legend, the gridline every thousand dollars. Tufte’s question is brutally simple: of all the ink on this page, how much of it is data-ink, and how much could you erase and lose nothing? A good graphic has a high ratio of data-ink to total ink. The discipline is to go mark by mark and ask of each one, “if I rub this out, does the reader lose a fact?” If not, it goes.

But erasing is only half the craft. The other half is choosing the right way to draw each number in the first place, and for that we turn to Jacques Bertin. Bertin noticed that we only have a small handful of ways to encode a value visually — he called them the visual variables, and they are the alphabet of every chart ever made: position (where a mark sits), size (how big it is), value (how light or dark), color (which hue), shape, orientation (which way it tilts), and texture. Any chart you can name is just data attributes mapped onto some of these variables. A bar chart maps quantity to length and category to position along an axis. A pie chart maps quantity to angle. A heatmap maps quantity to value. Pick the variable that fits the data, Bertin showed, and the chart reads itself; pick the wrong one — quantity onto raw color hue, say — and the reader has to work to decode what should have been obvious.

The third idea tells you which variable is the right one, and it is the part most people never learn. William Cleveland and Robert McGill ran the experiments, in 1984, that nobody had bothered to run: they showed people the same numbers encoded different ways and measured how accurately readers could actually judge the magnitudes. The result is a ranking. People read position along a common scale more accurately than almost anything else. Then length. Then angle and slope. Then area. Then volume and color shading, down at the bottom, where judgments are wildly imprecise. This single finding is why a bar chart beats a pie chart for the same data: a bar chart asks the reader to compare positions and lengths — near the top of the accuracy ranking — while a pie chart asks them to compare angles and areas, which sit much lower. The pie isn’t ugly; it’s just asking the eye to do a job the eye is bad at.

Put the three together and the whole craft of a dense, honest graphic comes into focus. Name the exact comparison the reader has to make. Use Cleveland and McGill to pick the visual variable that supports that comparison most accurately. Use Bertin to check that every other attribute is mapped to a fitting variable too. Then use Tufte to erase everything that is not carrying data, so the encoding you chose so carefully stands clear of the noise. Walk back to the 3-D rainbow pie with that toolkit and the verdict writes itself: the angle-and-area encoding is the least accurate option for a part-to-whole read, the depth and gloss are chartjunk that distort the very magnitudes they decorate, and the rainbow assigns hue — a variable Bertin reserves for categories, not amounts — to quantities. Strip it to a sorted bar chart with the numbers beside the bars, and you have packed more meaning into less ink, asking less of the reader, which is the entire game.

Framework & implementation

Output contract

The deliverable is a fixed set of sections, so the critique is auditable and implementable rather than a loose impression: a graphic summary and intended message (what the reader is meant to learn, and the exact comparison the graphic must support — the fixed target the rest of the audit is measured against); a data structure read (the attributes in play and whether each is categorical, ordered, or quantitative); the encoding choices per variable (each data attribute paired with the visual variable encoding it, with a fitness verdict against Bertin’s properties and the mismatch named where one exists); a data-ink audit (each mark classified as data-ink, structure-ink, or chartjunk, with the specific marks that could be removed without information loss listed); a perceptual-accuracy check (the elementary perceptual task the message requires, and the gap between what the chart asks the eye to do and what the eye does accurately, per Cleveland and McGill); and a redesign — prescriptive, ranked recommendations (which mark to remove, which encoding to substitute, which hierarchy to strengthen), each paired with the operation that surfaced it, its impact rank, and any residual tradeoff (brand or house-style, accessibility, data-honesty, audience expectation). Recommendations specific enough to implement without further interpretation are the verification threshold; a vague “simplify, declutter, improve hierarchy” is reshaped at the formatter step.

Origin and evidence

The three traditions the mode fuses were built decades apart and only later recognized as halves of one craft. Edward Tufte’s The Visual Display of Quantitative Information (1983) named the data-ink ratio and the idea of chartjunk, turning “good taste” in graphics into a measurable per-mark discipline. Jacques Bertin’s Semiology of Graphics (originally Sémiologie graphique, 1967) laid out the visual variables — the finite alphabet of visual encoding — and the properties that make each one fit or unfit for a given kind of data, founding the formal study of graphics as a language. William Cleveland and Robert McGill’s 1984 paper “Graphical Perception,” in the Journal of the American Statistical Association, supplied the experimental evidence that ranks those encodings by how accurately people actually read them — the finding that turns Bertin’s choices from preference into measurement and explains, in hard numbers, why position-based charts beat angle- and area-based ones. The lineage carries forward into modern data-visualization research and the grammar-of-graphics systems that now power most charting tools.

Applications and common uses

Dashboard and report critique. The native use: a business chart or KPI dashboard audited mark by mark and redrawn so the decision it supports can be read at a glance.
Journalism and explanatory graphics. Auditing the chart behind a story — checking that the encoding does not exaggerate or bury the magnitude the reader is meant to take away.
Scientific and technical figures. Catching the angle-for-length and area-for-position substitutions that quietly distort results in published figures and slides.
Maps and infographics. Testing whether a choropleth, a flow map, or an infographic encodes its quantities on variables the eye reads accurately, rather than on hue or 3-D height.
Slide and presentation design. Stripping the chartjunk that decoration-heavy templates add by default, so a single number or comparison lands.

Failure modes and when not to use it

Tufte-minimalism as dogma. Treating “erase ink” as an absolute strips gridlines, labels, or a touch of redundancy that was genuinely helping the reader. Minimalism is the default, not a commandment; the mode flags where modest structure or annotation actively serves the message.
Recommendations as gestures. “Simplify” and “improve hierarchy” are not findings — they are the absence of one. The mode’s discipline is mark-specific, encoding-specific, hierarchy-specific changes a designer can implement directly.
Auditing without a target. The verdicts all depend on the intended message and audience; run the audit without naming what the graphic must communicate and the perceptual-task and hierarchy checks have nothing to measure against. The mode degrades for rough sketches and surfaces a degraded-audit flag when the input fidelity is too low for mark-by-mark assessment.

When not to reach for it. When the input is a free composition — a painting, a poster, a page laid out for feel — and the question is the eye-path and the relative visual weight of its parts rather than encoding fitness, route to compositional-dynamics, the mode for reading visual force and balance without prescriptive encoding critique. When the operative work is negative space — the deliberately held-open void of a typographic page or a contemplative layout — route to ma-reading, which reads emptiness as an active element. And when the input is not an information graphic at all but an inhabited place whose character is the subject, that is a different territory entirely.

Compositional Dynamics — the depth-lighter sibling in the same territory: when the question is how the eye moves across a free composition and where its visual weight sits, not whether a chart’s encoding is accurate.
Ma Reading — the stance-counterpart in this territory: reading the deliberately held-open void — negative space as an active element — rather than the density of marks.
Tufte Data-Ink and Chartjunk — one of the three lenses this mode loads: classify every mark as data, structure, or junk, and erase the junk so the signal stands clear.
Bertin Visual Variables and Cleveland-McGill Perceptual Tasks — the other two lenses: pick the visual variable that fits each data attribute, and choose the encoding the eye reads most accurately (position over length over angle over area).