Apify
dadosfera.services.apify.get_dataset_from_apify
get_dataset_from_apify(dataset_name)
Retrieve dataset items from Apify storage using the official client.
This function connects to Apify's storage service, retrieves or creates a dataset by name, and downloads all items from it. The function requires an Apify API key to be set in the environment variables.
PARAMETER | DESCRIPTION |
---|---|
dataset_name
|
Name of the dataset in Apify storage. Case-sensitive name of an existing dataset or name for a new one. Example: "my-scraped-data-2024" or "competitor-analysis"
TYPE:
|
RETURNS | DESCRIPTION |
---|---|
List[Dict[str, Any]]
|
List[Dict[str, Any]]: List of items from the dataset. Each item is a dictionary with structure depending on the dataset content. |
RAISES | DESCRIPTION |
---|---|
ApifyApiError
|
When API calls to Apify fail. Common cases: - Invalid API key - Rate limit exceeded - Network issues - Invalid dataset name format |
KeyError
|
When APIFY_API_KEY environment variable is not set. |
JSONDecodeError
|
When the dataset content is not valid JSON. |
UnicodeDecodeError
|
When the dataset content cannot be decoded as UTF-8. |
Example
Assuming APIFY_API_KEY is set in environment variables
dataset = get_dataset_from_apify("my-web-scraper-results") print(f"Retrieved {len(dataset)} items") for item in dataset: ... print(f"Found item: {item['title']}")
Notes
- Requires APIFY_API_KEY environment variable to be set
- Creates new dataset if name doesn't exist
- Downloads entire dataset into memory
- Uses UTF-8 encoding for dataset content
- Returns empty list if dataset is empty
Environment Variables
APIFY_API_KEY (str): The API key for Apify service.
See Also
- Apify API Documentation: https://docs.apify.com/api/v2
- Apify Client Python: https://docs.apify.com/api/client/python/
- Apify Dataset Documentation: https://docs.apify.com/platform/storage/dataset
Source code in dadosfera/services/apify.py
5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 |
|