A curated list of awesome datasources for Apache Spark built on top of the Python Data Source API.
- allisonwang-db/pyspark-data-sources - Custom PySpark data sources showcasing the Python Data Source API.
- alexott/cyber-spark-data-connectors - Cybersecurity-related custom data connectors for Spark. Blog post
- dmoore247/PythonDataSources - Python DataSource classes for Spark 4.x with healthcare and life sciences examples.
- dgomez04/pyspark-hubspot - Custom data source connector for reading from HubSpot's CRM objects.
- dgomez04/pyspark-faker - A lightweight toy project built to explore the new PySpark 4.0 custom data source API.