This function provides the frequency of collocations in comments that correspond to the provided transcript, using fuzzy matching.
Arguments
- transcript_token
transcript token to act as baseline for notes, resulting from
token_transcript()
- note_token
tokenized document of notes, resulting from
token_comments()
- collocate_length
the length of the collocation. Default is 5
Examples
comment_example_rename <- dplyr::rename(comment_example, page_notes=Notes)
toks_comment <- token_comments(comment_example_rename)
transcript_example_rename <- dplyr::rename(transcript_example, text=Text)
toks_transcript <- token_transcript(transcript_example_rename)
collocation_object <- collocate_comments_fuzzy(toks_transcript, toks_comment)
#> Joining with `by = join_by(unlist.descript_ngrams.)`
#> Joining with `by = join_by(collocation)`
#> Joining with `by = join_by(collocation)`
#> Joining with `by = join_by(collocation.y)`
#> Joining with `by = join_by(collocation)`
#> Joining with `by = join_by(word_number)`