SmartExtractor class

Smart content extractor that uses multiple fallback strategies

Constructors

SmartExtractor.new()

Properties

hashCode int
The hash code for this object.
no setterinherited
runtimeType Type
A representation of the runtime type of the object.
no setterinherited

Methods

noSuchMethod(Invocation invocation) → dynamic
Invoked when a nonexistent method or property is accessed.
inherited
toString() String
A string representation of this object.
inherited

Operators

operator ==(Object other) bool
The equality operator.
inherited

Static Methods

extractAll(String html) SmartContent
Extract all common content types from HTML
extractAuthor(String html) String?
Extract author information
extractDescription(String html) String?
Extract meta description
extractEmails(String html) List<String>
Extract email addresses
extractImages(String html) List<String>
Extract all images with their URLs
Extract all links
extractMainContent(String html) String?
Extract main article content
extractOpenGraph(String html) OpenGraphData?
Extract Open Graph metadata
extractPhoneNumbers(String html) List<String>
Extract phone numbers
extractPrices(String html) List<String>
Extract prices (for e-commerce sites)
extractPublishDate(String html) String?
Extract publication date
extractTitle(String html) String?
Extract page title using multiple fallback strategies