{"id":237,"date":"2021-09-02T11:16:03","date_gmt":"2021-09-02T15:16:03","guid":{"rendered":"https:\/\/arriablog22.wpengine.com\/why-neural-language-models-dont-work-well-in-nlg\/"},"modified":"2022-03-31T16:37:24","modified_gmt":"2022-03-31T20:37:24","slug":"why-neural-language-models-dont-work-well-in-nlg","status":"publish","type":"post","link":"https:\/\/www.arria.com\/blog\/why-neural-language-models-dont-work-well-in-nlg\/","title":{"rendered":"Why neural language models don\u2019t work well in NLG"},"content":{"rendered":"<p>I am often asked about neural language models such as BERT and GTP3, and whether Arria uses such models to generate text. I usually explain that while neural models have been very successful in NLP applications such as machine translation (MT), they are much less useful in data-to-text NLG (i.e., the kind of systems Arria builds). But why is this?<!--more--><\/p>\n<p>One reason is that hallucination (i.e., neural language models producing narratives that are factually wrong) is a much more serious problem in NLG than in MT. In NLG, hallucinations happen on a grand scale. We recently did a study of <a href=\"\/finding-creative-solutions-to-detect-mistakes-in-neural-nlg-narratives\" rel=\"noopener\" target=\"_blank\">mistakes in basketball stories produced by neural NLG systems<\/a>, and found that each story contained, on average, 20 factual errors (hallucinations). This is a huge number, and far beyond what is acceptable in Arria-type systems. Furthermore, many of these hallucinations were fairly subtle (e.g., \u201cscored 8 points\u201d instead of \u201cscored 12 points\u201d) and hence hard for readers to detect. The number of hallucinations would need to drop by at least a factor of 100 before Arria would consider using this technology in its systems.<\/p>\n<p>In machine translation, though, hallucination happens less frequently, and when it does occur, it is usually in edge cases or situations where there is a serious mismatch between what the system is asked to translate and what it was trained on. In other words, when an MT system is operating in its \u201ccomfort zone\u201d, hallucination is rare. Hallucination does occur when the MT system is pushed beyond its comfort zone, but even here most of the hallucinations result in texts that are obviously wrong to readers, because they contain nonsensical or repeated phrases. Hence, hallucination in MT is mostly a concern to MT vendors who sell systems to users in safety-critical applications (especially when the users don\u2019t understand what the system\u2019s comfort zone is).<\/p>\n<p>To give an analogy, an MT system based on a neural language model is like a human translator who does a good job when translating news articles (which are well written in everyday language), but makes mistakes when translating clinical notes from a doctor (which are very technical and often poorly written). Such a translator is certainly useful provided we understand her limitations. A data-to-text NLG system based on a neural language model, though, is like a journalist who takes the philosophy \u201cDon\u2019t let facts get in the way of a good story\u201d to an extreme, and writes stories that are plausible and well-written but have little connection to reality. This may be acceptable when generating fiction or fake news, but it is not acceptable when NLG texts are used to help people make important decisions!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I am often asked about neural language models such as&#8230;<\/p>\n","protected":false},"author":10,"featured_media":1351,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3],"tags":[13,14,29],"class_list":["post-237","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-blog","tag-natural-language-generation","tag-nlg","tag-nlg-blog"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Why neural language models don\u2019t work well in NLG - NLG Blog<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.arria.com\/blog\/why-neural-language-models-dont-work-well-in-nlg\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Why neural language models don\u2019t work well in NLG - NLG Blog\" \/>\n<meta property=\"og:description\" content=\"I am often asked about neural language models such as...\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.arria.com\/blog\/why-neural-language-models-dont-work-well-in-nlg\/\" \/>\n<meta property=\"og:site_name\" content=\"NLG Blog\" \/>\n<meta property=\"article:published_time\" content=\"2021-09-02T15:16:03+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2022-03-31T20:37:24+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.arria.com\/blog\/wp-content\/uploads\/sites\/3\/2022\/03\/Blog-Image-31-Why-neural-language-models-dont-work-well-in-NLG-1024x431.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1024\" \/>\n\t<meta property=\"og:image:height\" content=\"431\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Ehud Reiter\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Ehud Reiter\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/why-neural-language-models-dont-work-well-in-nlg\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/why-neural-language-models-dont-work-well-in-nlg\\\/\"},\"author\":{\"name\":\"Ehud Reiter\",\"@id\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/#\\\/schema\\\/person\\\/31224a0ca3829f43a13781b3d7afd7e0\"},\"headline\":\"Why neural language models don\u2019t work well in NLG\",\"datePublished\":\"2021-09-02T15:16:03+00:00\",\"dateModified\":\"2022-03-31T20:37:24+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/why-neural-language-models-dont-work-well-in-nlg\\\/\"},\"wordCount\":456,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/why-neural-language-models-dont-work-well-in-nlg\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/3\\\/2022\\\/03\\\/Blog-Image-31-Why-neural-language-models-dont-work-well-in-NLG.png\",\"keywords\":[\"Natural Language Generation\",\"NLG\",\"NLG Blog\"],\"articleSection\":[\"NLG Blog\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/www.arria.com\\\/blog\\\/why-neural-language-models-dont-work-well-in-nlg\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/why-neural-language-models-dont-work-well-in-nlg\\\/\",\"url\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/why-neural-language-models-dont-work-well-in-nlg\\\/\",\"name\":\"Why neural language models don\u2019t work well in NLG - NLG Blog\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/why-neural-language-models-dont-work-well-in-nlg\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/why-neural-language-models-dont-work-well-in-nlg\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/3\\\/2022\\\/03\\\/Blog-Image-31-Why-neural-language-models-dont-work-well-in-NLG.png\",\"datePublished\":\"2021-09-02T15:16:03+00:00\",\"dateModified\":\"2022-03-31T20:37:24+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/why-neural-language-models-dont-work-well-in-nlg\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.arria.com\\\/blog\\\/why-neural-language-models-dont-work-well-in-nlg\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/why-neural-language-models-dont-work-well-in-nlg\\\/#primaryimage\",\"url\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/3\\\/2022\\\/03\\\/Blog-Image-31-Why-neural-language-models-dont-work-well-in-NLG.png\",\"contentUrl\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/3\\\/2022\\\/03\\\/Blog-Image-31-Why-neural-language-models-dont-work-well-in-NLG.png\",\"width\":1900,\"height\":800,\"caption\":\"Blog Image - 31 Why neural language models don\u2019t work well in NLG\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/why-neural-language-models-dont-work-well-in-nlg\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Why neural language models don\u2019t work well in NLG\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/#website\",\"url\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/\",\"name\":\"NLG Blog\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/#organization\",\"name\":\"NLG Blog\",\"url\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/3\\\/2022\\\/03\\\/arria_logo_125x30.png\",\"contentUrl\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/3\\\/2022\\\/03\\\/arria_logo_125x30.png\",\"width\":125,\"height\":30,\"caption\":\"NLG Blog\"},\"image\":{\"@id\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/#\\\/schema\\\/logo\\\/image\\\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/#\\\/schema\\\/person\\\/31224a0ca3829f43a13781b3d7afd7e0\",\"name\":\"Ehud Reiter\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/b2b51dd76543bef69265c1b1b8d995a0132ea071f50988250c00fdea10b15bf9?s=96&d=mm&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/b2b51dd76543bef69265c1b1b8d995a0132ea071f50988250c00fdea10b15bf9?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/b2b51dd76543bef69265c1b1b8d995a0132ea071f50988250c00fdea10b15bf9?s=96&d=mm&r=g\",\"caption\":\"Ehud Reiter\"},\"url\":\"https:\\\/\\\/www.arria.com\\\/blog\\\/author\\\/ehud-reiter\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Why neural language models don\u2019t work well in NLG - NLG Blog","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.arria.com\/blog\/why-neural-language-models-dont-work-well-in-nlg\/","og_locale":"en_US","og_type":"article","og_title":"Why neural language models don\u2019t work well in NLG - NLG Blog","og_description":"I am often asked about neural language models such as...","og_url":"https:\/\/www.arria.com\/blog\/why-neural-language-models-dont-work-well-in-nlg\/","og_site_name":"NLG Blog","article_published_time":"2021-09-02T15:16:03+00:00","article_modified_time":"2022-03-31T20:37:24+00:00","og_image":[{"width":1024,"height":431,"url":"https:\/\/www.arria.com\/blog\/wp-content\/uploads\/sites\/3\/2022\/03\/Blog-Image-31-Why-neural-language-models-dont-work-well-in-NLG-1024x431.png","type":"image\/png"}],"author":"Ehud Reiter","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Ehud Reiter","Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.arria.com\/blog\/why-neural-language-models-dont-work-well-in-nlg\/#article","isPartOf":{"@id":"https:\/\/www.arria.com\/blog\/why-neural-language-models-dont-work-well-in-nlg\/"},"author":{"name":"Ehud Reiter","@id":"https:\/\/www.arria.com\/blog\/#\/schema\/person\/31224a0ca3829f43a13781b3d7afd7e0"},"headline":"Why neural language models don\u2019t work well in NLG","datePublished":"2021-09-02T15:16:03+00:00","dateModified":"2022-03-31T20:37:24+00:00","mainEntityOfPage":{"@id":"https:\/\/www.arria.com\/blog\/why-neural-language-models-dont-work-well-in-nlg\/"},"wordCount":456,"commentCount":0,"publisher":{"@id":"https:\/\/www.arria.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.arria.com\/blog\/why-neural-language-models-dont-work-well-in-nlg\/#primaryimage"},"thumbnailUrl":"https:\/\/www.arria.com\/blog\/wp-content\/uploads\/sites\/3\/2022\/03\/Blog-Image-31-Why-neural-language-models-dont-work-well-in-NLG.png","keywords":["Natural Language Generation","NLG","NLG Blog"],"articleSection":["NLG Blog"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.arria.com\/blog\/why-neural-language-models-dont-work-well-in-nlg\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.arria.com\/blog\/why-neural-language-models-dont-work-well-in-nlg\/","url":"https:\/\/www.arria.com\/blog\/why-neural-language-models-dont-work-well-in-nlg\/","name":"Why neural language models don\u2019t work well in NLG - NLG Blog","isPartOf":{"@id":"https:\/\/www.arria.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.arria.com\/blog\/why-neural-language-models-dont-work-well-in-nlg\/#primaryimage"},"image":{"@id":"https:\/\/www.arria.com\/blog\/why-neural-language-models-dont-work-well-in-nlg\/#primaryimage"},"thumbnailUrl":"https:\/\/www.arria.com\/blog\/wp-content\/uploads\/sites\/3\/2022\/03\/Blog-Image-31-Why-neural-language-models-dont-work-well-in-NLG.png","datePublished":"2021-09-02T15:16:03+00:00","dateModified":"2022-03-31T20:37:24+00:00","breadcrumb":{"@id":"https:\/\/www.arria.com\/blog\/why-neural-language-models-dont-work-well-in-nlg\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.arria.com\/blog\/why-neural-language-models-dont-work-well-in-nlg\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.arria.com\/blog\/why-neural-language-models-dont-work-well-in-nlg\/#primaryimage","url":"https:\/\/www.arria.com\/blog\/wp-content\/uploads\/sites\/3\/2022\/03\/Blog-Image-31-Why-neural-language-models-dont-work-well-in-NLG.png","contentUrl":"https:\/\/www.arria.com\/blog\/wp-content\/uploads\/sites\/3\/2022\/03\/Blog-Image-31-Why-neural-language-models-dont-work-well-in-NLG.png","width":1900,"height":800,"caption":"Blog Image - 31 Why neural language models don\u2019t work well in NLG"},{"@type":"BreadcrumbList","@id":"https:\/\/www.arria.com\/blog\/why-neural-language-models-dont-work-well-in-nlg\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.arria.com\/blog\/"},{"@type":"ListItem","position":2,"name":"Why neural language models don\u2019t work well in NLG"}]},{"@type":"WebSite","@id":"https:\/\/www.arria.com\/blog\/#website","url":"https:\/\/www.arria.com\/blog\/","name":"NLG Blog","description":"","publisher":{"@id":"https:\/\/www.arria.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.arria.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.arria.com\/blog\/#organization","name":"NLG Blog","url":"https:\/\/www.arria.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.arria.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.arria.com\/blog\/wp-content\/uploads\/sites\/3\/2022\/03\/arria_logo_125x30.png","contentUrl":"https:\/\/www.arria.com\/blog\/wp-content\/uploads\/sites\/3\/2022\/03\/arria_logo_125x30.png","width":125,"height":30,"caption":"NLG Blog"},"image":{"@id":"https:\/\/www.arria.com\/blog\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/www.arria.com\/blog\/#\/schema\/person\/31224a0ca3829f43a13781b3d7afd7e0","name":"Ehud Reiter","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/b2b51dd76543bef69265c1b1b8d995a0132ea071f50988250c00fdea10b15bf9?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/b2b51dd76543bef69265c1b1b8d995a0132ea071f50988250c00fdea10b15bf9?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/b2b51dd76543bef69265c1b1b8d995a0132ea071f50988250c00fdea10b15bf9?s=96&d=mm&r=g","caption":"Ehud Reiter"},"url":"https:\/\/www.arria.com\/blog\/author\/ehud-reiter\/"}]}},"_links":{"self":[{"href":"https:\/\/www.arria.com\/blog\/wp-json\/wp\/v2\/posts\/237","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.arria.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.arria.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.arria.com\/blog\/wp-json\/wp\/v2\/users\/10"}],"replies":[{"embeddable":true,"href":"https:\/\/www.arria.com\/blog\/wp-json\/wp\/v2\/comments?post=237"}],"version-history":[{"count":0,"href":"https:\/\/www.arria.com\/blog\/wp-json\/wp\/v2\/posts\/237\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.arria.com\/blog\/wp-json\/wp\/v2\/media\/1351"}],"wp:attachment":[{"href":"https:\/\/www.arria.com\/blog\/wp-json\/wp\/v2\/media?parent=237"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.arria.com\/blog\/wp-json\/wp\/v2\/categories?post=237"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.arria.com\/blog\/wp-json\/wp\/v2\/tags?post=237"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}