Apache Hop
Blog
Documentation
Getting started User manual Technical Documentation Developer Documentation Architecture Roadmap
Community
Contributing Tools Mailing Lists Events and User Groups Team Support Ethos
Download
Apache
Events License Thanks to our sponsors Apache Security Become a sponsor

User manual

    • Getting Started
      • What is Hop
      • Hop Concepts
      • Download & Install
      • Tools
      • Hop Gui Overview
      • Pipelines
      • Workflows
      • Next Steps
    • Concepts
    • Installation and Configuration
    • Supported Java Versions
    • Hop Gui
      • File Dialog
      • Popup Dialog
      • Perspectives
      • Working with git
      • Keyboard Shortcuts
      • Hop Web
    • Projects
      • Projects & Environments
      • Metadata
      • Advanced Configuration
    • Pipelines
      • Pipeline Editor
      • Create a Pipeline
      • Run, Preview and Debug a Pipeline
      • Error Handling
      • Getting started with Apache Beam
        • Running the Beam samples
          • Direct Runner
          • Apache Flink
          • Apache Spark
          • Google Cloud Dataflow
        • Running a Hop pipeline using the Flink Kubernetes Operator
      • Pipeline Run Configurations
        • Beam Google DataFlow
        • Beam Direct
        • Beam Flink
        • Beam Spark
        • Native Local
        • Native Remote
      • Pipeline Unit Tests
      • Metadata Injection
      • Partitioning
      • Transforms
        • Abort
        • Add a checksum
        • Add Constants
        • Add Sequence
        • Add value fields changing sequence
        • Add XML
        • Analytic Query
        • Append Streams
        • Apache Tika
        • Avro Decode
        • Avro Encode
        • Avro File Input
        • Avro File Output
        • AWS SNS Notify
        • AWS SQS Reader
        • Azure Event Hubs Listener
        • Azure Event Hubs Writer
        • Beam Bigtable Input
        • Beam Bigtable Output
        • Beam BigQuery Input
        • Beam BigQuery Output
        • Beam GCP Pub/Sub : Publish
        • Beam GCP Pub/Sub : Subscribe
        • Beam Input
        • Beam Kafka Consume
        • Beam Kafka Produce
        • Beam Kinesis Consume
        • Beam Kinesis Produce
        • Beam Output
        • Beam Timestamp
        • Beam Window
        • Blocking transform
        • Blocking until transforms finish
        • Calculator
        • Cassandra Input
        • Cassandra Output
        • Cassandra SSTable Output
        • Call DB procedure
        • Change file encoding
        • Check if file is locked
        • Check if webservice is available
        • Clone row
        • Closure
        • Coalesce
        • Column exists
        • Combination lookup/update
        • Concat Fields
        • Copy rows to result
        • Credit card validator
        • CSV File Input
        • Database Join
        • Database Lookup
        • Data Grid
        • Data Validator
        • Delay row
        • Delete
        • De-Serialize From File
        • Detect Empty Stream
        • Dimension lookup/update
        • Doris Bulk Loader
        • Drools Rule Accumulator
        • Drools Rule Executor
        • Dummy (do nothing)
        • Dynamic SQL row
        • Edi to XML
        • Email Messages Input
        • Enhanced JSON Output
        • Excel input
        • Excel writer
        • Execute a process
        • Execute row SQL script
        • Execute SQL script
        • Execute Unit Tests
        • Execution Information
        • Fake data
        • File exists
        • File Metadata
        • Files from result
        • Files to result
        • Filter Rows
        • Formula
        • Fuzzy match
        • Generate Random Value
        • Get Data From XML
        • Get filenames
        • Get files rowcount
        • Get records from stream
        • Get Rows from Result
        • Get SubFolder names
        • Get System Info
        • Get table names
        • Get variables
        • Google Analytics Input
        • Google Sheets Input
        • Google Sheets Output
        • Group By
        • HTTP client
        • HTTP Post
        • Identify last row in a stream
        • If Null
        • Injector
        • Insert / Update
        • Java Filter
        • JavaScript
        • JDBC Metadata
        • Join Rows
        • JSON Input
        • JSON Output
        • Kafka Consumer
        • Kafka Producer
        • LDAP Input
        • LDAP Output
        • Load file content in memory
        • Mail
        • Mapping Input
        • Mapping Output
        • Memory Group By
        • Merge Join
        • Merge rows (diff)
        • Metadata Injection
        • Metadata Input
        • Metadata structure of stream
        • Microsoft Access Output
        • MonetDB Bulk Loader
        • MongoDB Delete
        • MongoDB Input
        • MongoDB Output
        • Multiway Merge Join
        • Neo4j Cypher
        • Neo4j Generate CSVs
        • Neo4j Get Logging Info
        • Neo4j Graph Output
        • Neo4j Import
        • Neo4j Output
        • Neo4j Split Graph
        • Null If
        • Number range
        • Oracle Bulk Loader
        • Parquet File Input
        • Parquet File Output
        • Pipeline Data Probe
        • Pipeline Logging
        • PGP decrypt stream
        • PGP encrypt stream
        • Pipeline Executor
        • PostgreSQL Bulk Loader
        • Process files
        • Properties file Input
        • Properties file Output
        • Redshift Bulk Loader
        • Regex Evaluation
        • Replace in String
        • Reservoir Sampling
        • REST Client
        • Row Denormaliser
        • Row Flattener
        • Row Generator
        • Row Normaliser
        • Run SSH commands
        • Salesforce Delete
        • Salesforce Input
        • Salesforce Insert
        • Salesforce Update
        • Salesforce Upsert
        • Sample Rows
        • SAS Input
        • Script
        • Select Values
        • Serialize To File
        • Server Status
        • Set field value to a constant
        • Set field Value to a field
        • Set Variables
        • Simple Mapping
        • Snowflake Bulk Loader
        • Sort Rows
        • Sorted Merge
        • Split Fields
        • Split fields to rows
        • Splunk Input
        • SQL File Output
        • Standardize Phone Number
        • Static Schema Mapping
        • Stream Lookup
        • Stream Schema Merge
        • String cut
        • String operations
        • Switch / Case
        • Synchronize after merge
        • Table Compare
        • Table Exists
        • Table Input
        • Table Output
        • Teradata Bulk Loader
        • Text File Input
        • Text File Output
        • Token Replacement
        • Unique Rows
        • Unique Rows (HashSet)
        • Update
        • User Defined Java Class
        • User Defined Java Expression
        • Value Mapper
        • Vertica Bulk Loader
        • Web services lookup
        • Workflow Executor
        • Workflow Logging
        • Write to log
        • XML Input Stream (StAX)
        • XML Join
        • XML Output
        • XSD Validator
        • XSL Transformation
        • Yaml Input
        • Zip file
    • Workflows
      • Create a Workflow
      • Run and Debug a Workflow
      • Workflow Run Configurations
      • Actions
        • Abort
        • Add filenames to result
        • AS/400 Command
        • Cassandra Exec Cql
        • Check Db connections
        • Check Files Locked
        • Check if Webservice is available
        • Check if XML file is well formed
        • Checks if files exists
        • Columns exist in a table
        • Copy Files
        • Copy Or Move Files from result
        • Create File
        • Create Folder
        • Decrypt files with PGP
        • Delay
        • Delete File
        • Delete filenames from result
        • Delete Files
        • Delete Folders
        • Display Msgbox info
        • Dos To Unix
        • DTD Validator
        • Dummy
        • Encrypt files with PGP
        • End Repeat
        • Evaluate File Metrics
        • Evaluate row number in a table
        • File compare
        • File Exists
        • Folder is Empty
        • Folders Compare
        • FTP delete
        • Get a file with FTP
        • Get a file with SFTP
        • Get Mails (POP3/IMAP)
        • Http Get
        • JavaScript
        • Mail
        • Mail Validator
        • Move Files
        • MS SQL Server Bulk Loader
        • MySQL Bulk File
        • MySQL Bulk Loader
        • Neo4j Check Connections
        • Neo4j Constraint
        • Neo4j Cypher Script
        • Neo4j Index
        • Ping
        • Pipeline
        • Put a file with FTP
        • Put a file with SFTP
        • Repeat
        • Run Pipeline Tests
        • Send Nagios check
        • Set Variables
        • Shell Action
        • Simple Evaluation
        • Snmp Trap
        • Snowflake Warehouse Manager
        • SQL Script Executor
        • Start
        • Success Action
        • Table Exists
        • Telnet a host
        • Truncate Tables
        • Unzip
        • Verify file signature with PGP
        • Wait for file
        • Wait for SQL
        • Workflow
        • Write to File
        • Write to Log
        • XSD Validator
        • XSL Transformation
        • Zip File
    • Variables
    • Virtual File System
      • Amazon Web Services S3
      • Microsoft Azure Storage
      • Dropbox
      • Google Cloud Storage
      • Google Drive
    • Relational Database Connections
      • Apache Derby
      • DuckDB
      • Apache Hive
      • AS400
      • ClickHouse
      • CrateDB
      • DB2
      • Apache Doris
      • Exasol
      • Firebird
      • Google BigQuery
      • Greenplum
      • Gupta SQLBase
      • H2
      • Hypersonic
      • IBM Universe
      • Infobright DB
      • Informix
      • Ingres
      • Interbase
      • InterSystems Cache
      • InterSystems Iris
      • Kingbase ES
      • Mariadb
      • MaxDB (SAP DB)
      • Microsoft Access
      • MonetDB
      • MS SqlServer
      • MS SqlServer (Native)
      • MySql
      • Netezza
      • Oracle
      • Oracle RDB
      • PostgreSQL
      • Redshift
      • Snowflake
      • SqLite
      • Sybase
      • Sybase IQ
      • Teradata
      • Vectorwise
      • Vertica
    • Logging basics
      • Logging reflection
    • Metadata Types
      • Asynchronous Web Service
      • Beam File Definition
      • Cassandra Connection
      • Data Set
      • Execution Data Profile
      • Execution Information Location
      • Hop Server
      • MongoDB Connection
      • Neo4j Connection
      • Neo4j Graph Model
      • Partition Schema
      • Pipeline Log
      • Pipeline Probe
      • Pipeline Run Configuration
      • Pipeline Unit Test
      • Relational Database Connection
      • Splunk Connection
      • Static Schema Definition
      • Web Service
      • Workflow Log
      • Workflow Run Configuration
    • Passwords
      • AES Two way password encoder
    • Plugins
      • External Plugins
      • Kettle/PDI Import
    • Samples
    • Technology
      • Azure
      • Cassandra
      • Memgraph
      • Neo4j
        • Working with Neo4j data
      • Google
    • Hop Server
      • REST api
      • Web Service
      • Async Web Service
    • Hop REST Web Application
    • Hop Tools
      • hop-conf
      • hop-encrypt
      • hop-gui
      • hop-run
      • hop-search
      • hop-import
      • hop-server
      • hop-translator
    • Best Practices
    • Pro Tips
    • Unique Selling Propositions
    • How-to guides
      • Hop web services in Docker
      • Joins and lookups
      • Logging pipeline data with pipeline log
      • Logging workflow data with workflow log
      • Loops in Apache Hop
      • Parallel execution in workflows
      • Run Hop workflows and pipelines in Apache Airflow
    • Community Posts
User manual 2.9.0
  • Development Documentation
    • 2.14.0
    • 2.13.0
    • 2.12.0
    • 2.11.0
    • 2.10.0
    • 2.9.0
    • 2.8.0
    • 2.7.0
    • 2.6.0
    • 2.5.0
  • Technical Documentation
    • 2.14.0
    • 2.13.0
    • 2.12.0
    • 2.11.0
    • 2.10.0
    • 2.9.0
    • 2.8.0
    • 2.7.0
    • 2.6.0
    • 2.5.0
  • User manual
    • 2.14.0 (pre-release)
    • 2.13.0
    • 2.12.0
    • 2.11.0
    • 2.10.0
    • 2.9.0
    • 2.8.0
    • 2.7.0
    • 2.6.0
    • 2.5.0
    • 2.4.0
    • 2.3.0
    • 2.2.0
    • 2.1.0
    • 2.0.0
  • User manual
  • Getting Started
2.14.0 (pre-release) 2.13.0 2.12.0 2.11.0 2.10.0 2.9.0 2.8.0 2.7.0 2.6.0 2.5.0 2.4.0 2.3.0 2.2.0 2.1.0 2.0.0
Edit this Page

Getting Started

This guide walks you through the Hop basics.

We’ll focus on the core knowledge you need to move around in Hop, without going in detail. At the end of this guide, there will be links to dive deeper into various Hop topics.

This is what we’ll cover:

  • What is Hop?

  • Concepts

  • Download & Install

  • Tools

  • Hop Gui overview

  • Pipelines

  • Workflows

  • Next Steps

User manual What is Hop
Overview
Download
Getting started
Community
GitHub Issues
GitHub Discussions
Team
About
License

© 2019-2025 The Hop Team.
All marks mentioned may be trademarks or registered trademarks of their respective owners.