A collection of recent papers on NLG evaluations, very applicable to components of LLM systems.
			
	
	- 
	
	
	Can Large Language Models Be an Alternative to Human Evaluations?Paper โข 2305.01937 โข Published โข 3
- 
	
	
	Decontextualization: Making Sentences Stand-AlonePaper โข 2102.05169 โข Published
- 
	
	
	RARR: Researching and Revising What Language Models Say, Using Language ModelsPaper โข 2210.08726 โข Published โข 1
- 
	
	
	SummEval: Re-evaluating Summarization EvaluationPaper โข 2007.12626 โข Published

 
								 
								




