fetchFromProblematicSite method

pivox package
documentation
features/web_scraping/web_scraper_extension.dart
WebScraperExtension
fetchFromProblematicSite method

Future<String> fetchFromProblematicSite({

required String url,
Map<String, String>? headers,
int? timeout = 60000,
int? retries = 5,

})

Fetches HTML content from a problematic site using specialized techniques

This method uses multiple approaches to handle sites that are known to be difficult to scrape, such as those with anti-scraping measures

Implementation

Future<String> fetchFromProblematicSite({
  required String url,
  Map<String, String>? headers,
  int? timeout = 60000,
  int? retries = 5,
}) async {
  // Log the attempt
  logger.info('Attempting to fetch from problematic site: $url');

  // First try with the standard method
  try {
    return await fetchHtml(
      url: url,
      headers: headers,
      timeout: timeout,
      retries: retries,
    );
  } catch (e) {
    logger.warning('Standard fetch failed: $e');
    logger.info('Trying alternative approaches...');

    // Try with different approaches
    return _tryAlternativeApproaches(url, headers, timeout ?? 60000);
  }
}

Implementation

WebScraperExtension extension on WebScraper