Evaluating Text Extraction: Developing a Toolkit for ...