Mocking is an essential part of unit testing, and the Mockito library makes it easy to write clean and intuitive unit tests for your Java code.
Get started with mocking and improve your application tests using our Mockito guide:
Handling concurrency in an application can be a tricky process with many potential pitfalls. A solid grasp of the fundamentals will go a long way to help minimize these issues.
Get started with understanding multi-threaded applications with our Java Concurrency guide:
Spring 5 added support for reactive programming with the Spring WebFlux module, which has been improved upon ever since. Get started with the Reactor project basics and reactive programming in Spring Boot:
Since its introduction in Java 8, the Stream API has become a staple of Java development. The basic operations like iterating, filtering, mapping sequences of elements are deceptively simple to use.
But these can also be overused and fall into some common pitfalls.
To get a better understanding on how Streams work and how to combine them with other language features, check out our guide to Java Streams:
Get started with Spring and Spring Boot, through the Learn Spring course:
>> LEARN SPRINGExplore Spring Boot 3 and Spring 6 in-depth through building a full REST API with the framework:
Yes, Spring Security can be complex, from the more advanced functionality within the Core to the deep OAuth support in the framework.
I built the security material as two full courses - Core and OAuth, to get practical with these more complex scenarios. We explore when and how to use each feature and code through it on the backing project.
You can explore the course here:
Spring Data JPA is a great way to handle the complexity of JPA with the powerful simplicity of Spring Boot.
Get started with Spring Data JPA through the guided reference course:
Refactor Java code safely β and automatically β with OpenRewrite.
Refactoring big codebases by hand is slow, risky, and easy to put off. Thatβs where OpenRewrite comes in. The open-source framework for large-scale, automated code transformations helps teams modernize safely and consistently.
Each month, the creators and maintainers of OpenRewrite at Moderne run live, hands-on training sessions β one for newcomers and one for experienced users. Youβll see how recipes work, how to apply them across projects, and how to modernize code with confidence.
Join the next session, bring your questions, and learn how to automate the kind of work that usually eats your sprint time.
1. Overview
In the first article of the series, we introduced Spring Cloud Data Flowβs architectural component and how to use it to create a streaming data pipeline.
As opposed to a stream pipeline, where an unbounded amount of data is processed, a batch process makes it easy to create short-lived services where tasks are executed on demand.
2. Local Data Flow Server and Shell
The Local Data Flow Server is a component that is responsible for deploying applications, while the Data Flow Shell allows us to perform DSL commands needed for interacting with a server.
In the previous article, we used Spring Initilizr to set them both up as a Spring Boot Application.
After adding the @EnableDataFlowServer annotation to the serverβs main class and the @EnableDataFlowShell annotation to the shellβs main class respectively, they are ready to be launched by performing:
mvn spring-boot:run
The server will boot up on port 9393 and a shell will be ready to interact with it from the prompt.
You can refer to the previous article for the details on how to obtain and use a Local Data Flow Server and its shell client.
3. The Batch Application
As with the server and the shell, we can use Spring Initilizr to set up a root Spring Boot batch application.
After reaching the website, simply choose a Group, an Artifact name and select Cloud Task from the dependencies search box.
Once this is done, click on the Generate Project button to start downloading the Maven artifact.
The artifact comes preconfigured and with basic code. Letβs see how to edit it in order to build our batch application.
3.1. Maven Dependencies
First of all, letβs add a couple of Maven dependencies. As this is a batch application, we need to import libraries from the Spring Batch Project:
<dependency>
<groupId>org.springframework.boot</groupId>
<artifactId>spring-boot-starter-batch</artifactId>
</dependency>
Also, as the Spring Cloud Task uses a relational database to store results of an executed task, we need to add a dependency to an RDBMS driver:
<dependency>
<groupId>com.h2database</groupId>
<artifactId>h2</artifactId>
</dependency>
Weβve chosen to use the H2 in-memory database provided by Spring. This gives us a simple method of bootstrapping development. However, in a production environment, youβll want to configure your own DataSource.
Keep in mind that artifactsβ versions will be inherited from Spring Bootβs parent pom.xml file.
3.2. Main Class
The key point to enabling desired functionality would be to add the @EnableTask and @EnableBatchProcessing annotations to the Spring Bootβs main class. This class level annotation tells Spring Cloud Task to bootstrap everything:
@EnableTask
@EnableBatchProcessing
@SpringBootApplication
public class BatchJobApplication {
public static void main(String[] args) {
SpringApplication.run(BatchJobApplication.class, args);
}
}
3.3. Job Configuration
Lastly, letβs configure a job β in this case a simple print of a String to a log file:
@Configuration
public class JobConfiguration {
private static Log logger
= LogFactory.getLog(JobConfiguration.class);
@Autowired
public JobBuilderFactory jobBuilderFactory;
@Autowired
public StepBuilderFactory stepBuilderFactory;
@Bean
public Job job() {
return jobBuilderFactory.get("job")
.start(stepBuilderFactory.get("jobStep1")
.tasklet(new Tasklet() {
@Override
public RepeatStatus execute(StepContribution contribution,
ChunkContext chunkContext) throws Exception {
logger.info("Job was run");
return RepeatStatus.FINISHED;
}
}).build()).build();
}
}
Details on how to configure and define a job are outside the scope of this article. For more information, you can see our Introduction to Spring Batch article.
Finally, our application is ready. Letβs install it inside our local Maven repository. To do this cd into the projectβs root directory and issue the command:
mvn clean install
Now itβs time to put the application inside the Data Flow Server.
4. Registering the Application
To register the application within the App Registry we need to provide a unique name, an application type, and a URI that can be resolved to the app artifact.
Go to the Spring Cloud Data Flow Shell and issue the command from the prompt:
app register --name batch-job --type task
--uri maven://com.baeldung.spring.cloud:batch-job:jar:0.0.1-SNAPSHOT
5. Creating a Task
A task definition can be created using the command:
task create myjob --definition batch-job
This creates a new task with the name myjob pointing to the previously registeredbatch-job application .
A listing of the current task definitions can be obtained using the command:
task list
6. Launching a Task
To launch a task we can use the command:
task launch myjob
Once the task is launched the state of the task is stored in a relational DB. We can check the status of our task executions with the command:
task execution list
7. Reviewing the Result
In this example, the job simply prints a string in a log file. The log files are located within the directory displayed in the Data Flow Serverβs log output.
To see the result we can tail the log:
tail -f PATH_TO_LOG\spring-cloud-dataflow-2385233467298102321\myjob-1472827120414\myjob
[...] --- [main] o.s.batch.core.job.SimpleStepHandler: Executing step: [jobStep1]
[...] --- [main] o.b.spring.cloud.JobConfiguration: Job was run
[...] --- [main] o.s.b.c.l.support.SimpleJobLauncher:
Job: [SimpleJob: [name=job]] completed with the following parameters:
[{}] and the following status: [COMPLETED]
8. Conclusion
In this article, we have shown how to deal with batch processing through the use of Spring Cloud Data Flow.
