An article is extracted from a document using a decision combiner to process a plurality of reading order alternatives. The text flow analysis generates the plurality of reading order alternatives of separate body text regions....http://www.google.com.hk/patents/US7756871?utm_source=gb-gplus-share專利 US7756871 - Article extraction