PDFPageProxy_getTextContent
PartialEvaluator_getTextContent
From the discussion in issue 7445, it seems that there may be cases where an API consumer would want to get the text content as is, without combined text items.
parseCMap
unittest