R/special-characters.R
get_special_character_info.Rd
Get Special Characters and Their Byte Codes
get_special_character_info(text, context_length = 7, bytes_per_char = 2)
vector of character of length one
number of characters left and right of special
character to be put into column context
number of bytes per character
data frame with columns special
(special characters) and
bytes
(hexadecimanl byte codes as a space separated string),
context
(strings "around" the special characters)
(text <- kwb.fakin:::example_string_with_specials("de"))
#> [1] "Schöne Grüße"
get_special_character_info(text)
#> special bytes context
#> 1 \xc3 c3 Sch [ ö ] ne Grü
#> 2 \xb6 b6 öne Gr [ ü ] ße
#> 3 \xc3 c3 ne Grü [ ß ] e
#> 4 \xbc bc Sch [ ö ] ne Grü
#> 5 \xc3 c3 öne Gr [ ü ] ße
#> 6 \x9f 9f ne Grü [ ß ] e