query_processing.Rmd
This vignette showcases the workflow to process the .csv
files resulting from the online query. For each language, the files are processed and checked for unique hits and titles.
languages <- c("english", "spanish", "portuguese")
for (language in languages){
csv.dir <- "data/latin_america/corpus_csv/"
csv.dir <- file.path(csv.dir, language)
csv.files <- get_csv_files(csv.dir)
citation_dataframe <- get_citation_dataframe(csv.files)
citation_dataframe <- check_duplicate_row(citation_dataframe)
citation_dataframe <- check_duplicate_title(citation_dataframe)
write_citation_dataframe(csv.dir)
}