-
Notifications
You must be signed in to change notification settings - Fork 30
Supporting Airflow as executor #671
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hi, I have not used Airflow so far but I think it could indeed be used as an alternative executor. It is going to be a bit tricky because the task graph generation is for now bound to dask and is implemented separately from the executors. But it is feasible. We did some evaluation of Airflow a while back and had the impression that it is more suited for larger, longer running tasks whereas dask works better for a large amount of tasks because it has less overhead. This may be outdated already, so please correct me if I'm wrong. Anyways, we came to the conclusion that sticking with dask was the better option for that time and didn't test Airflow. How many tasks (roughly) does your process spawn? As for implementation, a simple, first step would be to add an Cheers |
Hi, After thinking again about that, making an airflow executor might not be the right choice as stated above. I should keep airflow for orchestration and rely on dask for scheduling as it is so mush lightweight than airflow. Regards |
Hi,
I'm creating an issue here, but it's more like an a discussion.
I have developed a similar engine but it is far less powerful than this one, so I am looking how I can use it for my purpose.
My use case is building large scale geotiff from simulation grid. I am currently building COG on a web mercator tiles. I am using Apache Airflow as a task scheduler ( as I use it for other use cases).
I was wondering how can I use mapchete with an airflow excecutor. I guess that I have to extend the BaseExecutor, but as mapchete does the supervision, I don't think that it is the best way.
I think that I should generate a task list and then let Airflow run the dag with the task list.
Do you have an idea if there is a place where I can do that ?
Thanks
Regards
The text was updated successfully, but these errors were encountered: