Skip to main content

Table 4 Comparisons of GPT hallucinations when producing codes

From: Towards automated phenotype definition extraction using large language models

Model

Average %

Minimum %

Maximum %

GPT 3.5

38

0

83

GPT 4

32

0

69