2023-11-26
\[\begin{gather*} Input\xrightarrow[\text{}]{\text{Process}}Deliverables \end{gather*}\]
\[\begin{gather*} Problem\xrightarrow[\text{}]{\text{Thinking}}Solution \end{gather*}\]
\[\begin{gather*} Design\xrightarrow[\text{}]{\text{Programming}}Code \end{gather*}\]
Input
Output
\[\begin{gather*} Input\xrightarrow[\text{}]{\text{Process 1}}\text{Deliverable 1} \end{gather*}\]
\[\begin{gather*} Input\xrightarrow[\text{}]{\text{Process 2}}\text{Deliverable 2} \end{gather*}\]
\[\begin{gather*} \tiny{\text{Method 1 (quality 1, cost 1, ...)}}\\ \text{Input }\to\boxed{\xrightarrow[\text{}]{\text{Process 1}}\text{Deliverable 1}} \end{gather*}\]
\[\begin{gather*} \tiny{\text{Method 2 (quality 2, cost 2, ...)}}\\ \text{Input }\to\boxed{\xrightarrow[\text{}]{\text{Process 2}}\text{Deliverable 2}} \end{gather*}\]
Input | Process | Deliverable |
---|---|---|
Names of capitals | Information gathering | Table |
What is an LLM? | Understanding | Document |
A to-do list app | Software development | Code |
Process / Method |
Cost | Deliverable | Correctness |
---|---|---|---|
Remembering | Minimal | Table Incomplete |
Probably not |
Gathering data from Wikipedia |
Affordable | Table Complete |
Yes |
Process | Method | Cost | Deliverable | Correctness |
---|---|---|---|---|
Reading | Taking notes | Minimal | Document Plain English |
Subjective understanding Not verifiable |
Reading | The scientific method 1 | Affordable | Document Structured |
Objective understanding Verifiable by peers |
Reading | Formal method 2 3 | Expensive | Math model | Theorem provers Fully verifiable |
Process: MVP-style 1 software development 2
Step 3 | Method | Cost | Deliverables | Correctness |
---|---|---|---|---|
Specification | Taking notes | Minimal | Document Plain English |
Subjective understanding Not verifiable |
Implementation | Just write code | Minimal | Code Imperative |
Verifiable by tests |
Verification | Write some tests | Affordable | Test suite | By 100% code coverage |
Process: Production-ready software development
Step | Method | Cost | Deliverables | Correctness |
---|---|---|---|---|
Specification | Design in practice 1 | Affordable | Document Structured |
Objective understanding Verifiable by peers |
Implementation |
Finite-state machine 2 Pattern matching 3 |
Affordable | Code Declarative4 |
Built-in correctness mechanisms The rest verifiable by tests |
Verification | Property-based testing 5 | Affordable | Test suite | 100% code coverage |
Process: Mission critical 1 software development
Step 2 | Method | Cost | Deliverables | Correctness |
---|---|---|---|---|
Specification | Formal specification 3 Theorem building |
Expensive | Math model | Yes |
Verification | Formal proof Theorem proover |
Expensive | Math model | Yes |
Implementation | Obtained 4 | Affordable | Code | Yes |
Methods | 16 |
Remembering Gathering data Taking notes … |
Deliverables | 8 |
Table Document Code … |
Correctness | 9 |
Probably not Subjective understanding Not verifiable … |
Country | Capital |
---|---|
Abkhazia | Sukhumi |
Afghanistan | Kabul |
… | … |
Zimbabwe | Harare |
Table
"""Migrate a Sim to a Sim2"""
def migrate_sim(sim::Sim):
init_sys = System2(sim.cell, sim.atoms)
final_sys = unknown(init_sys.cell.pbc)
return Sim2(sim.calc, init_sys, final_sys)
for old_sim in get_sims(old_db_cxn):
insert_sim(migrate_sim(old_sim), new_db_cxn)
new_query = "SELECT S1.pth, S2.pth FROM ..."
Code
Syntax | Formalism | Notes |
---|---|---|
Table | Strong | Row, column, cell |
Document Plain english |
None | Free form Typographic structure only |
Document Structured |
Some | Strict form Follows the scientific method |
Code Test suite Math model |
Strong | Precise, formal language |
Syntax | Formalism | Correctness |
---|---|---|
Table | Strong | Yes |
Document Plain english |
None | Subjective understanding Not verifiable |
Document Structured |
Some | Objective understanding Verifiable by peers |
Code | Strong |
Yes Built-in correctness mechanisms The rest verifiable by tests |
Test suite | Strong | Yes 100% code coverage |
Math model | Strong |
Fully verifiable Theorem provers |
Country | Capital |
---|---|
Abkhazia | Washington, DC |
Afghanistan | Kabul |
… | … |
Zimbabwe | Harare |
Deliverable type | Defintion of correctness |
---|---|
Table | Rows, columns, cells must be correct |
Structured document for understaning |
Must follow the Scientific method 1 Observation -> Question -> Hypothesis -> … |
Formal specification 2 | Logically consistent Accurate Complete |
Code 3 |
Functional correctness Partial correctness Total correctness |
Formal methods already connect deliverable types and levels of correctness
Method | Deliverable | Correctness |
---|---|---|
Informal | Free form, natural language | Ambiguity and lack of organization can lead to incompleteness, inconsistency, and misunderstandings |
Formatted (Semi formal) |
Standardized syntax |
Basic consistency and completeness checks Imprecise semantics implies other sources of error may still be present |
Formal |
Syntax and semantics rigorously defined Precise form, perhaps mathematical |
Eliminate imprecision and ambiguity Provide basis for mathematically verifying equivalence between specification and implementation |
As an example, suppose we have the hypothesis that all apples are red, and we test this hypothesis by examining a group of ten apples and seeing what color they are. If all ten apples are red, our hypothesis is supported, but it’s not proven: if we looked at more apples, some of them might turn out to be green. 1
\[\begin{gather*} \tiny{\text{Provably correct, it's a proof}}\\ \text{Input }\to\boxed{\xrightarrow[\text{}]{\text{Process 1}}\text{Works 100%}} \end{gather*}\]
\[\begin{gather*} \tiny{\text{Likely correct, fails on edge cases}}\\ \text{Input }\to\boxed{\xrightarrow[\text{}]{\text{Process 2}}\text{Works 100% - edges}} \end{gather*}\]
\[\begin{gather*} \tiny{\text{Formal method, Provably correct}}\\ \text{Input }\to\boxed{\xrightarrow[\text{Semantics ✅}]{\text{Syntax ✅}}\text{Works 100%}} \end{gather*}\]
\[\begin{gather*} \tiny{\text{Semi-formal method, Likely correct}}\\ \text{Input }\to\boxed{\xrightarrow[\text{Semantics ❓}]{\text{Syntax ✅}}\text{Works 100% - edges}} \end{gather*}\]
Theorem provers may be used to undertake fully formal machine-checked proofs. Despite improving tools and declining costs, this can be very expensive and is only practically worthwhile if the cost of mistakes is very high (e.g., in critical parts of operating system or microprocessor design).
Method | Correctness | Cost |
---|---|---|
Formal | Provably correct | Expensive |
Semi-formal | Likely correct | Affordable |
Informal | Maybe correct | Minimal, but it’s a debt |