Useful Data Science Quotes

Statistics is the mathematical language of scientific experimentation. It embodies the framing of key questions, design of experiments, analysis of data, and interpretation of results. As such, there are many quotations that have come along over the years. This is my brief attempt to compile some quotes that capture the flavor of the science of statistics.

  • Noubar Afeyan, Co-founder of Moderna and CEO of Flagship Pioneering
    • Exceptional achievements like the rapid development of the COVID-19 vaccine, he believes, aren’t usually unpredictable but rather the result of systematic processes that include embracing unreasonable propositions and even unreasonable people. Apple Podcast
  • Lee Artsimovich
    • Science is the best way to satisfy your own curiosity for the governmental account.
  • Charles Babbage
    • Errors using inadequate data are much less than those using no data at all.
  • John M. Barry
    • How does one know when one knows?… How does one know when to continue to push an experiment? And how does one know when to abandon a clue as a false trail? No one interested in truth will torture the data itself. But a scientist can–and should–seek any way to answer a question…. And if one experiment shows a hint of a result … a bump …, then a scientist designs the next experiment … to create conditions more likely to get more bumps until they become either consistent and meaningful or demonstrate that the initial bump was mere random variation without meaning…. One key to science is that work be reproducible. Someone in another laboratory doing the same experiment will get the same result. The result then is reliable enough that someone can build upon it. The most damning condemnation is to dismiss a finding as `not reproducible.’ That can call into question not only ability but on occasion ethics. If a reproducible finding comes from torturing nature, however, it is not useful. To be useful a result must … be … expandable. One must be able to enlarge it, explore it, learn from it, use it as a foundation to build structures upon. These things become easy to discern in hindsight. But how does one know when to persist, when to continue to try to make an experiment work, when to make adjustments–and when finally to abandon a line of thought as mistaken or incapable of solution with present techniques?… The question is one of judgment. For the distinguishing element of science is not intelligence but judgment. Or perhaps it is simply luck. (2004, pp. 262-263) The Great Influenza: The epic story of the deadliest plague in history. Viking Press.
  • Douglas M. Bates
    • Statistics is the science of variation. (~1985)
  • William Bateson
    • The allelic state at one locus can mask or uncover the effects of allelic variation at another. (1907)
  • Karl W. Broman
    • Expression genetic data may not require a new breed of scientist, but they probably will require a new breed of collaboration. The focus of the computational biologist will need to change from the development of tools that answer specific questions to the development of general tools that enable biologists to carry out their own investigations—to explore, visualize and find biological signals in complex data. (2005 Nat Genet 37: 209-210)
  • Rita Mae Brown
    • One out of every four people is suffering from some form of mental illness. Check three friends. If they’re OK, then it’s you.
  • George E.P. Box
    • All models are wrong, but some models are useful. (1979) Robustness in the strategy of scientific model building, in Robustness in Statistics, R.L. Launer and G.N. Wilkinson, Editors. 1979, Academic Press: New York.
    • Statisticians, like artists, have the bad habit of falling in love with their models.
  • Sir Winston Churchill
    • The utmost confusion is caused when people argue on different statistical data.
  • William C. Cochran
    • Where there is often a question for the doctor if it is ethical to conduct an experiment, but from the broader view-point [it is] a question of whether it is ethical not to conduct experiments.
  • Confucius
    • Real knowledge is to know the extent of one’s ignorance.
    • Life is really simple, but we insist on making it complicated.
  • William Cowper
    • Knowledge is proud that he has learned so much, Wisdom is humble that he knows no more. The Task, Book 6: The Winter Walk at Noon (1785)
  • Gertrude M Cox
    • The statistician who supposes that his main contribution to the planning of an experiment will involve statistical theory, finds repeatedly that he makes his most valuable contribution simply by persuading the investigator to explain why he wishes to do the experiment.
  • W. Edward Deming
    • Profit in business comes from repeat customers, customers that boast about your project or service, and that bring friends with them.
    • Innovation comes from the producer – not from the customer.
  • Paul Dirac
    • It is more important to have beauty in one’s equations that to have them fit experiment…. if one has really a sound insight, one is on a sure line of progress. If there is not complete agreement between the results of one’s work and experiment, one should not allow oneself to be too discouraged, because the discrepancy may well be due to minor features that are not properly taken into account and that will get cleared up with further development of the theory. [The evolution of the physicist’s picture of nature. Scientific American 208(5) (1963)]
    • The steady progress of physics requires for its theoretical formulation a mathematics which get continually more advanced … modern physical developments have required a mathematics that continually shifts its foundation and gets more abstract…. the advance in physics is to be associated with continual modification and generalisation of the axioms at the base of mathematics rather than with a logical development of any one mathematical scheme on a fixed foundation. [Paper on Magnetic Monopoles (1931)]
  • Benjamin Disraeli or Alfred Marshall or Mark Twain? (attributed)
    • There are three types of lies–lies, damn lies and statistics.
  • David Donoho (2000)
    • The coming century is surely the century of data.
  • Dogbert (Scott Adams)
    • But the shocking discovery was that 50% were below the median age.
  • The Economist (2010)
    • industrial revolution of data.
  • Bradley Efron
    • Those who ignore statistics are condemned to reinvent it. Statistics is the science of learning from experience. (2006)
  • Albert Einstein
    • God does not play dice with the universe.
    • Don’t do what I’ve done. Always have a cobbler’s job. Always have a job where you can get up in the morning, face yourself, that you’re doing something useful for humanity. Because nobody can be a genius every day. Don’t make that kind of mistake. You know, when I accepted a job at the University of Berlin, I had no duties really. Nothing to do except wake up and solve all the problems of the universe every morning. Nobody can do that. Don’t make that mistake. (as told to Ernest J. Sternglass)
    • Keep a cobbler’s job, so you can pursue wild ideas in your spare time. (paraphrased? by Gwyneth Jones in Life, p. 194, ISBN 0-9746559-2-9).
  • TS Eliot
    • Where is the Life we have lost in living? Where is the wisdom we have lost in knowledge? Where is the knowledge we have lost in information? The Rock (1934)
  • Evan Esar, prolific epigramologist
    • Statistics: the only science that enables different experts using the same figures to draw different conclusions.
  • Richard Feynman
    • You are under no obligation to remain the same person you were a year ago, a month ago, or even a day ago. You are here to create yourself, continuously.
  • Sir Ronald A. Fisher
    • We may at once admit that any inference from the particular to the general must be attended with some degree of uncertainty, but this is not the same as to admit that such inference cannot be absolutely rigorous, for the nature and degree of the uncertainty may itself be capable of rigorous expression. (1935) The Design of Experiments
    • Natural selection is a mechanism for generating an exceedingly high degree of improbability.
    • Biometry, the active pursuit of biological knowledge by quantitative methods. (1947) Innagural address to the International Biometric Society (Biometrics, sep 1948: 217-219).
  • Galileo Galilei
    • Nature is written in mathematical language.
  • Andrew Grove (2005)
    • The dominant cause for [the] discrepancy [between progress in the healthcare and microchip industries] appears to lie in the disparate rates of knowledge turns between the 2 industries…. Knowledge turns are indicators of the time it takes for an experiment to proceed from hypothesis to results and then lead to a new hypothesis and a new result. in JAMA, 2005 (in a talk by Josh Sommer, Chordoma Foundation, 2010, at the Sage Congress)
  • JBS Haldane as quoted in his obituary by Maynard Smith in Nature 206 (1965), p. 239.
    • An ounce of algebra is worth of a ton of verbal argument.
  • Hippocrates of Cos, On the Sacred Disease
    • For a theory is a composite memory of things apprehended with sense perception…. I approve of theorizing also if it lays its foundation in incident, and deduces its conclusion in accordance with phenomena.
  • Thomas H. Huxley (1860)
    • Sit down before a fact as a little child, be prepared to give up every preconceived notion. Follow humbly wherever and to whatever abysses nature leads, or you shall learn nothing.
  • E. T. Jaynes (1986)
    • Laplace [used] … Bayes’ theorem … to help him decide which astronomical problems to work on…. in which problems is the discrepancy between prediction and observation large enough to give a high probability that there is something new to be found? in Bayesian Methods: General Background
  • Michael Kattan (2002)
    • Imperfect prediction, despite being imperfect, can be valuable for decision-making purposes. in Statistical prediction models, artificial neural networks, and the sophism “I Am a Patient, Not a Statistic”. J Clin Oncol 20: 885-887. [Paper]
  • Lincoln Moses
    • Statistics is a body of methods for learning from experience.
  • Frederick Mosteller
    • Sound policies rest on good information.
  • Daniel H. Mowrey
    • The job of a statistician in an animal health world is challenging and interesting. In a real sense, you become part of the effort to find new ways of improving life in our civilization.
  • Jerzy Neyman (~1980)
    • Life is complicated, but not uninteresting.
  • Florence Nightengale (1820-1910)
    • Statistics … is the most important science in the whole world, for upon it depends the practical application of every other (science) and of every art.
    • To understand God’s thoughts we must study statistics, for these are the measure of His purpose.
  • John Paulos, Prof of Math
    • 79.48% of all statistics are made up on the spot.
  • Karl Pearson
    • Statistics is the grammar of science.
  • Plato
    • I know that I know nothing. The Apology
  • Lord Ernest Rutherford
    • If your experiment needs statistics, you ought to do a better experiment. (or: If you need to use statistics, then you should design a better experiment.)
  • Sir Michael Scholar, chair of the UK Statistics Authority
    • I think there’s been too much a tendency in the past to see the only important user [of statistics] as being the government. I think we have to get away from that.
  • Clifford Stoll (1995)
    • Our networks are awash in data. A little of it’s information. A smidgen of this shows up as knowledge. Combined with ideas, some of that is actually useful. Mix in experience, context, compassion, discipline, humor, tolerance, and humility, and perhaps knowledge becomes wisdom. in Silicon Snake Oil: second thoughts on the information highway, Anchor Books & Doubleday. ch. 11.
    • (Cliff Stoll uncovered the first know cyber attack, from E Germany in 1986; see https://en.m.wikipedia.org/wiki/Clifford_Stoll & 1990 NOVA program)
  • David Suzuki (2017)
    • The laboratory is an artifact. It does not reflect nature. (Paraphrased from 2007 talk. See his 1989 book, Inventing the future: Reflections on science, technology and nature. Stoddart Publ.)
  • John Wilder Tukey (1915-2000)
    • The best thing about being a statistician is that you get to play in everybody’s backyard.
    • The combination of some data and an aching desire for an answer does not ensure that a reasonable answer can be extracted from a given body of data.
    • Today, [software] is at least as important as the `hardware’ of tubes, transistors, wires, tapes and the like.
    • Far better an approximate answer to the right question, which is often vague, than the exact answer to the wrong question, which can always be made precise.
  • Simon Winchester
    • The basic idea within the thickets of DIKW is that knowledge now has tendrils easily distinguished, one from the other, and only when considered all together as an interlocking whole do the represent the complete spectrum of our relationship with Plato’s justified true belief…. The question of wisdom, however, and the concern for its future in a world where data, information, and knowledge are in massive, exponentially increasing, and ever accelerating oversupply… Knowing What We Know (2023, p 32-33)
    • … all these wondrous machines that, in one way or another, now allow us to avoid overworking our brains, will somehow diminish our capacity for thought … [or] instead clear the mind and allow it, now less clouded, taxed, and troubled, to seek out the potential it always had? To be thoughtful, considerate, patient–and wise…. And by doing so perhaps come to know not simply what we do know, but what we should know, in order to be fully human. Knowing What We Know (2023, p 625-626)
  • Brian S. Yandell (2007) as a comment to W. Whipple Neely.
    • Good statistical inference never strays very far from the data.
  • Sandy Zabell
    • Statistical analysis in cases involving small numbers can be particularly helpful because on many occasions intuition can be highly misleading. in Statistics, a Guide to the Unknown (third ed)

Written on July 15, 2022