
Modern data engineering teams spend a significant amount of time managing infrastructure clusters, jobs, notebooks, and environments. While graphical interfaces are useful for exploration and visibility, they are not always the most efficient option for repetitive or operational workflows, especially when teams are aiming for faster data operations using a CLI-first approach.
Yeedu CLI is a native command-line interface for the Yeedu data platform, designed to give engineers direct, scriptable access to everything they can do in the Yeedu UI straight from the terminal. It enables teams to run, monitor, and optimize Spark workloads entirely from the command line, without relying on slow, click-heavy workflows.
It is not a separate product. It is not an external tool. It is simply the fastest way to work with Yeedu and a practical example of how CLI improves operational efficiency in modern data platforms
Yeedu CLI is built on top of the same APIs, security model, and permissions used by the Yeedu UI. Every command you run through the CLI operates on your actual Yeedu workspaces, clusters, jobs, and notebooks.
Whether you prefer visual interaction or terminal-driven workflows, you are working with the same Yeedu platform, just through different interfaces. This parity ensures that automations created in the CLI behave exactly the same as actions performed in the UI, making it safe to operationalize critical workflows.
Data teams rarely operate on a single operating system. Production environments often run on Linux, data scientists prefer macOS, and analysts frequently work on Windows.
Yeedu CLI delivers a consistent experience across Windows, Linux, and macOS, allowing teams using Yeedu to share scripts, automation, and operational workflows without platform-specific adjustments.
Install once. Use everywhere. Operate Yeedu the same way across your entire team as an essential requirement for scalable, CLI-driven data operations.
Many routine operational tasks such as creating clusters, triggering jobs, monitoring execution, or retrieving logs require multiple steps when performed through a web interface. Each action involves navigation, page loads, and manual refreshes, which slows down day-to-day operations.
With Yeedu CLI, the same workflows are reduced to single, explicit commands that can be executed directly from the terminal. This is one of the most direct ways teams experience faster data operations using CLI-based tooling.
# Authenticate once
yeedu configure
# Create a cluster
yeedu cluster create --name "ml_cluster" --cluster_type STANDALONE --cluster_conf_id 2 --cloud_env_id 12 --spark_infra_version_id 11
# Start a Spark job
yeedu job start --job_id 123 --workspace_id 12
# Check job run status
yeedu job run-status --workspace_id 1 --run_id 1
# View notebook logs
yeedu notebook logs --workspace_id 1 --run_id 123 For teams wondering how to run a Spark job from the command line, Yeedu CLI removes the need for custom scripts or brittle integrations. Job execution, monitoring, and log access are all first-class CLI operations.
For teams running multiple jobs daily, these time savings quickly add up. Tasks that previously took several minutes through the UI can now be completed in seconds using the CLI.
Yeedu CLI also supports real-time job monitoring directly from the terminal. By using the --follow flag while starting a job, you can continuously track its execution status.
yeedu job start --job_id 1 --workspace_id 1 --follow When --follow is enabled, the CLI continuously fetches the job’s status until it reaches a terminal state:
This is especially useful during development, debugging, or production monitoring, where immediate feedback is critical. By reducing delays in detection and response, CLI-based monitoring directly contributes to better Spark performance optimization at runtime.
Instead of switching between the terminal and the UI, engineers can start a job and monitor its progress in a single, uninterrupted workflow.
At scale, this level of control and visibility significantly improves operational efficiency and reduces context switching across tools.
Installing Yeedu CLI takes a single command:
pip install yeedu-cli==4.10.4 Once installed, authentication can be configured in multiple secure ways.
YEEDU_RESTAPI_URL="https://your-yeedu-instance.com"
YEEDU_USERNAME=your_username
YEEDU_PASSWORD=your_password YEEDU_RESTAPI_TOKEN=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9...
YEEDU_RESTAPI_TOKEN_FILE_PATH="/path/to/token/file" The CLI automatically manages sessions and token refresh, allowing you to authenticate once and continue working without interruption, a key requirement for secure, automated Spark job execution from the command line.
Yeedu CLI includes built-in tab completion to help users discover commands and options as they work.
# Partial command
$ yeedu clus[Tab]
$ yeedu cluster
# Partial argument
$ yeedu cluster create --na[Tab]
$ yeedu cluster create --name This CLI tab completion capability reduces the need to constantly reference documentation and minimizes errors when constructing complex commands especially useful during active development or incident response.
By speeding up command discovery and reducing syntax errors, auto completion plays a direct role in improving day-to-day operational efficiency.
Yeedu CLI provides comprehensive access to platform capabilities.
yeedu resource list-providers
yeedu resource get-provider-machine-type --cloud_provider_id 1 --machine_type_id 2 yeedu workspace search --workspace_name "analytics"
yeedu workspace list --output json yeedu cluster create --name "ml_cluster" --cluster_type STANDALONE --cluster_conf_id 2 --cloud_env_id 12 --spark_infra_version_id 11
yeedu cluster list-status --cluster_id 12 Create and manage Spark jobs, schedule executions using cron expressions, monitor runs in real time, and retrieve logs all without leaving the terminal. This end-to-end control is central to how CLI improves operational efficiency for Spark-based data platforms.
Yeedu CLI follows well-established command-line patterns, making it intuitive for engineers already familiar with modern CLIs.
Every command is designed around real Yeedu workflows managing Spark jobs, notebooks, and distributed compute environments.
Visual interfaces are essential for understanding and monitoring systems. However, when it comes to executing repeatable workflows at scale, Yeedu CLI gives engineers a more efficient, scriptable, and predictable way to work with Yeedu.
Yeedu CLI is the fastest way to interact with the Yeedu platform, and a practical foundation for Spark performance optimization through automation and repeatability.
Whether you are testing a single Spark job or orchestrating large-scale production workloads, Yeedu CLI enables you to operate Yeedu with precision, speed, and confidence directly from your terminal.
Complete documentation is available at docs.yeedu.io:
These resources are designed to help teams adopt Yeedu CLI quickly and safely.