Skip to content

Issues: apify/crawlee

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Add a new dontCreateNewSessions option to SessionPool t-tooling Issues with this label are in the ownership of the tooling team.
#569 opened Feb 7, 2020 by pocesar
Idea: we could add function to extract schema.org microdata from a page feature Issues that represent new features or improvements to existing features. good first issue Good for newcomers. t-tooling Issues with this label are in the ownership of the tooling team.
#276 opened Jan 11, 2019 by jancurn
Explore parallelization for CrawlerCheerio discussion feature Issues that represent new features or improvements to existing features. t-tooling Issues with this label are in the ownership of the tooling team.
#282 opened Jan 16, 2019 by jancurn
RequestQueue.getRequest() should use local cache bug Something isn't working. feature Issues that represent new features or improvements to existing features. t-tooling Issues with this label are in the ownership of the tooling team.
#297 opened Feb 1, 2019 by jancurn
RequestQueue could have a limit on max enqueued requests t-tooling Issues with this label are in the ownership of the tooling team.
#321 opened Feb 21, 2019 by mnmkng
Add pendingRequestCount field to BasicCrawler t-tooling Issues with this label are in the ownership of the tooling team.
#414 opened Jul 2, 2019 by jancurn
utils.social phonesFromText issues with slashes t-tooling Issues with this label are in the ownership of the tooling team.
#437 opened Aug 1, 2019 by bwundo
Integrate adblocker functionality feature Issues that represent new features or improvements to existing features. t-tooling Issues with this label are in the ownership of the tooling team.
#456 opened Sep 18, 2019 by jakubbalada
Few not matched social handles feature Issues that represent new features or improvements to existing features. t-tooling Issues with this label are in the ownership of the tooling team.
#525 opened Dec 4, 2019 by metalwarrior665
Implement async iterator to KeyValueStore? feature Issues that represent new features or improvements to existing features. t-tooling Issues with this label are in the ownership of the tooling team.
#541 opened Jan 3, 2020 by pocesar
Persist state for key-value store/dataset during iteration t-tooling Issues with this label are in the ownership of the tooling team.
#543 opened Jan 6, 2020 by metalwarrior665
requestsFromUrl could support .xlsx feature Issues that represent new features or improvements to existing features. t-tooling Issues with this label are in the ownership of the tooling team.
#564 opened Jan 24, 2020 by metalwarrior665
Crawlers should have an option to respect robots.txt feature Issues that represent new features or improvements to existing features. t-tooling Issues with this label are in the ownership of the tooling team.
#229 opened Nov 13, 2018 by jakubbalada
[request] FileCrawler t-tooling Issues with this label are in the ownership of the tooling team.
#565 opened Feb 1, 2020 by pocesar
Consider Apify.utils.countKeywords() feature Issues that represent new features or improvements to existing features. t-tooling Issues with this label are in the ownership of the tooling team.
#577 opened Feb 12, 2020 by metalwarrior665
add filter predicate on session.setCookiesFromResponse t-tooling Issues with this label are in the ownership of the tooling team.
#616 opened Feb 29, 2020 by pocesar
Make crawlers EventEmitter (middleware behavior / change internals) t-tooling Issues with this label are in the ownership of the tooling team.
#635 opened Mar 28, 2020 by pocesar
Create specialized and contextual errors t-tooling Issues with this label are in the ownership of the tooling team.
#646 opened Apr 6, 2020 by pocesar
CheerioCrawler should support application/rss+xml content type by default feature Issues that represent new features or improvements to existing features. t-tooling Issues with this label are in the ownership of the tooling team.
#709 opened May 28, 2020 by metalwarrior665
Add a function to validate/encode names and keys feature Issues that represent new features or improvements to existing features. t-tooling Issues with this label are in the ownership of the tooling team.
#747 opened Jul 20, 2020 by metalwarrior665
Add persistStateKeyValueStoreId to openRequestList t-tooling Issues with this label are in the ownership of the tooling team.
#792 opened Oct 2, 2020 by pocesar
Improve Session Management guide. documentation Improvements or additions to documentation. t-tooling Issues with this label are in the ownership of the tooling team.
#796 opened Oct 13, 2020 by mnmkng
Improve HTTP status code handling Epic An epic is a large body of work that can be broken down into a number of smaller issues. t-tooling Issues with this label are in the ownership of the tooling team.
#812 opened Oct 20, 2020 by mnmkng
Adding to queue with options parameter feature Issues that represent new features or improvements to existing features. t-tooling Issues with this label are in the ownership of the tooling team.
#939 opened Feb 17, 2021 by maoryadin
Add ability to start with "no proxy" feature Issues that represent new features or improvements to existing features. t-tooling Issues with this label are in the ownership of the tooling team.
#2740 opened Nov 8, 2024 by strongpauly
ProTip! Follow long discussions with comments:>50.