Practical Steps to Reduce Hallucination and…

Feb 14, 2023

Issue #11 | How can we make systems that integrate LLM's like ChatGPT more reliable? Here are practical techniques (and research) to mitigate hallucination and improve overall performance.

Read →

9 Comments

Onyekachukwu Okonji

Dec 4, 2023

Really enjoyed reading this. Currently doing research on hallucinations and I'm glad I came across this.

Expand full comment

Reply (1)

Victor Dibia, PhD

Dec 4, 2023Edited

Glad you found it useful.

Thanks for sharing feedback

Expand full comment

Taledy

Nov 28, 2023

Great. work. Very in-depth coverage.

Expand full comment

Reply (1)

Victor Dibia, PhD

Nov 29, 2023

Thanks @derrick! Glad you found it useful! Thanks for sharing and subscribing! Your support is much appreciated!

Expand full comment

Reply (1)

Taledy

Dec 2, 2023

Absolutely! Don't mention it.

Expand full comment

Doug Ross

Apr 1, 2023

Great summary, thank you.

Wondering if there are visualizations/UIs that can help capture model performance in various interim states?

Or provenance/bias of training datasets?

Expand full comment

Reply (1)

Victor Dibia, PhD

Apr 3, 2023

Model performance benchmarking is an interesting emergent area.

It is also hard because because performance is affected by factors beyond the model's weights (e.g. prompt design).

Some benchmarks

- HELM . https://crfm.stanford.edu/helm/latest/

- Eleuther AI LLM Eval https://github.com/EleutherAI/lm-evaluation-harness

UIs for benchmarking typically are task-based.

Some tools for comparing model outputs

- https://scale.com/spellbook

- https://twitter.com/natfriedman/status/1633582489850773504?lang=en

Expand full comment

Jon

Mar 3, 2023

Great work; curious, how do you generate your post images? It looks great

Expand full comment

Reply (1)

Victor Dibia, PhD

Mar 3, 2023

Thank you!

Expand full comment

Designing with AI

Practical Steps to Reduce Hallucination and…