摘要: A maximum entropy classifier can be used to extract sentences from documents. Experiments using technical documents show that such a tends treat features in categorical manner. This results performance is worse than when extracting naive Bayes classifier. Addition of an optimised prior the improves over and above (even also extended with similar prior). Further experiments that, should we have at our disposal extremely informative features, then able yield excellent results. Naive Bayes, contrast, cannot exploit these so fundamentally limits sentence extraction performance.