identifies the subset of paper with validation data and align databases

align_humanReadingTopicModel(titleInd, validationHumanReading, topicDocs, DTM)

Arguments

titleInd

cross-walked indices between human-reading database and topic model

validationHumanReading

human-reading database

topicDocs

results from the topic model

DTM

document-term matrix derived from topic modelled corpus

Value

list with four elements: titleInd, validationHumanReading, validationTopicDocs, validationDTM