Data Processing Overview
Data processing steps form the foundation of Jetty workflows, enabling file operations, data transformations, storage integration, and external service connections. These steps handle everything from simple file reading to complex data analysis pipelines with trajectory-based storage management.
Category Structure
Tools Steps (6 steps)
File operations, utilities, and external integrations:
text_concatenate- Combines multiple text files into a single outputread_text_file- Reads text content from storage backenddownload_image- Downloads images from URLs with metadata extractionsave_text_file- Saves text content to storage backendadd_image_metadata- Extracts and adds EXIF metadata to imageswebhook_notify- Sends HTTP notifications to external services
Trajectory Storage Context
All file operations integrate with trajectory storage:
- Storage Providers: S3, GCS, local filesystem
- Automatic Path Resolution: Trajectory-specific paths
- Content Type Handling: Automatic MIME type detection
Path Patterns
- Relative Paths:
outputs/file.txt→ trajectory-scoped - Absolute Paths:
/shared/data/file.txt→ shared across workflows - Dynamic Paths:
outputs/{{step_name}}_{{timestamp}}.txt
Next Steps
- Tools Steps - File operations and utilities
- Pandas Steps - Data analysis and manipulation
- Storage Integration - Backend configuration guide
Getting Help
- Review individual step documentation for detailed parameters
- Check the Flow Library for data processing examples
- Consult trajectory documentation for path resolution patterns