refextract: update test suite to reflect recent changes
- The test suite is now split into 'author', 'doi' ... sub-tests, testing each recognition feature separately. A variety of references have been added to each category.
- A 'mixed content' test has also been introduced, to test the recognition ability when combining different objects inside a reference.