Data Processing Overview

Data processing steps form the foundation of Jetty workflows, enabling file operations, data transformations, storage integration, and external service connections. These steps handle everything from simple file reading to complex data analysis pipelines with trajectory-based storage management.

Category Structure

Tools Steps (6 steps)

File operations, utilities, and external integrations:

text_concatenate - Combines multiple text files into a single output
read_text_file - Reads text content from storage backend
download_image - Downloads images from URLs with metadata extraction
save_text_file - Saves text content to storage backend
add_image_metadata - Extracts and adds EXIF metadata to images
webhook_notify - Sends HTTP notifications to external services

Trajectory Storage Context

All file operations integrate with trajectory storage:

Storage Providers: S3, GCS, local filesystem
Automatic Path Resolution: Trajectory-specific paths
Content Type Handling: Automatic MIME type detection

Path Patterns

Relative Paths: outputs/file.txt → trajectory-scoped
Absolute Paths: /shared/data/file.txt → shared across workflows
Dynamic Paths: outputs/{{step_name}}_{{timestamp}}.txt

Next Steps

Tools Steps - File operations and utilities
Pandas Steps - Data analysis and manipulation
Storage Integration - Backend configuration guide

Getting Help

Review individual step documentation for detailed parameters
Check the Flow Library for data processing examples
Consult trajectory documentation for path resolution patterns

Category Structure​

Tools Steps (6 steps)​

Trajectory Storage Context​

Path Patterns​

Next Steps​

Getting Help​