[Phase 3] Apache DolphinScheduler Weekly FAQ Collection

April 15-April 21

Q: Is there a mirror image of openEuler + DolphinScheduler?
Answer: No, the official images released by DolphinScheduler are all based on Debian.

Q: Are there any examples of DolphinScheduler resource management?
Answer: You can refer to the Apache DolphinScheduler usage specification and usage tips sharing , or look at the official documentation, https://dolphinscheduler.apache.org/zh-cn/docs/3.1.5/guide/resource/file-manage

Q: I can't connect to the test connection when creating a dolphin scheduling data source, but I can connect to it through telnet on the server. What's going on?
Answer: Please make sure that the username, password and optional configuration you fill in are correct, such as whether the MySQL ssl is enabled or not.

Q: Local file (file:///) is used for management. Is there a size limit for uploaded files?
Answer: There is no size limit, but too large may cause timeout when uploading.

Q: There are already users, why do we need the concept of tenants? What is the meaning of tenants corresponding to Linux users? If sudo.enable=false is set, is it equivalent to not being able to use multi-tenancy
? ds web operating system, tenant -> user of task or resource operation, for resource isolation.

Q: Does DolphinScheduler 3.1.5 support Trino sql task type configuration?
Answer: 3.1.5 does not support Trino, and Trino support is pending.

Q: There is a 13-hour difference between the end time displayed on the workflow instance page and the end_time in the database. What's the matter with the time inconsistency?
Answer: The time zone of MySQL and Sprintboot needs to be consistent.

Q: DolphinScheduler scheduling can only use python2.7 environment? Can the new version of python3 be called directly?
Answer: You can use Python3, but you need to modify the datax.py script.

Q: The streams are all SQL. After the job stream is online, how can the backend directly call the job stream? In DolphinScheduler, I really don’t want to see the call. Is the API of DolphinScheduler used for scheduling in the framework?
Answer: If the workflows are all SQL tasks, triggering workflows can be called through pages or API interfaces.

Q: How many tasks can DolphinScheduler support to submit? What is the maximum concurrency of a single node?
Answer: You can refer to: The latest performance test | Apache DolphinScheduler schedules tasks concurrency per minute is 2 times that of Apache Airflow

Q: What are the advantages and specific solutions of replacing Airflow with DolphinScheduler? Answer: DolphinScheduler is developed locally, and its operation
is more in line with the habits of Chinese people. It also has advantages in performance and functions. You can refer to the performance comparison article to view: The latest performance test | To Apache DolphinScheduler, you praise the evolution of the scheduling system of the big data development platform Migration tool: Migration tool Air2phin announced open source, 2-step migration of Airflow to Dolphinscheduler

Q: How to transfer data between two Python nodes? Now I can only open nas to share and write files, which feels very inconvenient.
Answer: At present, we can only do this. 3.2.0 will have a better method https://github.com/apache/dolphins cheduler/pull/12552

Question: DolphinScheduler 2.x has too many task records of more than 1 million on MySQL and the host, what is the automatic cleaning method?
Answer: It is recommended to write corntab to clear the log.

This article is published by OpenWrite, a multi-post platform for blogging !

Guess you like

Origin blog.csdn.net/DolphinScheduler/article/details/130390091