SpringBatch

Sabin Karki

Published Aug 12, 2018

Architecture

Spring Batch is an open source framework for batch processing. It is a lightweight, comprehensive solution designed to enable the development of robust batch applications,which are often found in modern enterprise systems --wiki . Technically batch reads the data from the source, processed the read data according to the business requirements and finally writes the processed data to the respective destination. UseCase :: Claim process in the insurance domain.

System Requirement

1.SpringBoot 2.Java8 3.Any Preferred IDE

ResourceUrl

GET http://localhost:8085/h2 >> console of h2 database. GET http://localhost:8085/launchJob1>>To launch Job1 GET http:://localhost:8085/launchJob2>>To launch Job2 DELETE http://localhost:8085>>To delete all the inserted record

About Project

pom.xml

Project Structure

This is the example of the batch application which runs at 8085 . I have configured H2 database and JPA for persistence. The purpose of the application is to read csv file, process it and write to the H2 database via spring batch. In order to stop the default behaviour of the spring batch in the properties file I have added spring.batch.job.enabled=false which means I will launch the batch job manually via REST .

Enabling Batch processing using @EnableBatchProcessing

@EnableBatchProcessing
@SpringBootApplication
public class BatchSpringRunner {

	public static void main(String[] args) {
		SpringApplication.run(BatchSpringRunner.class, args);
	}
}

We need to create BatchConfig.java which is in our source code. Here Jobs are created by JobBuilderFactory and Steps are created from StepBuilderFactory . Every Step has three parts ItemReader ,ItemProcessor and ItemWriter as well as we can added listener as I have added in my case. In my case I have two jobs with bean qualifier name job1 and job2 .If we have multiple steps we have to use flow() and if only one step we can use start() as well.

Lets take Job1 for explanation

BatchConfig.java

...

@Autowired
private JobBuilderFactory jobs;

@Bean(name ="job1")
public Job job1(JobCompletionNotificationListener1 listner){
    return jobs.get("job1")
            .incrementer(new RunIdIncrementer())
            .listener(listner)
            .flow(step1()) //execute a step or sequence of steps
            .next(step2())
            .end()
            .build();
}
...

In job1 we are passing JobCompletionNotificationListener1 which extends JobExecutionListenerSupport . The method afterJob gets triggered after the completion of step. Note: Step has three parts Read,Process and Write. Incrementer is a id that we assign for every run and we are using the default in our case. job1 has multiple steps step1 and step2.

step1

BatchConfig.java

...

@Autowired
private StepBuilderFactory stepBuilderFactory;

@Autowired
private VechileProcessor vechileProcessor;

@Autowired
private VechileWriter vechileWriter;

@Bean
public Step step1(){
    return stepBuilderFactory.get("step1")
            .<Vechile,Vechile>chunk(10)
            .reader(reader())
            .processor(vechileProcessor)
            .writer(vechileWriter)
            .build();
}
...

In step we have batch chunk of size 10 and input/output of type vechile. We have a reader() method whose sole purpose is to read the vechile.csv file and convert to the vechile entity.

BatchConfig.java
...

@Bean
public FlatFileItemReader<Vechile> reader() {
    return new     FlatFileItemReaderBuilder<Vechile>().name("vechileItemReader").resource(new ClassPathResource("vechiles.csv"))
            .delimited().names(new String[] {"type","model","built" })
            .linesToSkip(1)  //skipping row one from csv file
            .fieldSetMapper(new BeanWrapperFieldSetMapper<Vechile>(){
                
                {
                    setTargetType(Vechile.class);
                }
            }).build();
...

We are using the FlatFileItemReader which reads the csv from the ClassPathResource. After reading the csv , item is converted to targetType.i.e Vechile.java . vechileProcessor is the intermediate operation in which read data is transformed according to business requirements. In our case

@Component
public class VechileProcessor implements ItemProcessor<Vechile, Vechile> {

private  static long id =0;

@Override
public Vechile process(Vechile vechile) throws Exception {
    
    if(Integer.parseInt(vechile.getBuilt())>1998){
        final String model = firstIndexCapital(vechile.getModel()).toString();
        vechile = new Vechile(++id,vechile.getType(),model,vechile.getBuilt());
    }
    
    return vechile;
}

public StringBuilder firstIndexCapital(String word){
    StringBuilder sb = new StringBuilder();
    sb.append(word.charAt(0)+"".toUpperCase());
    sb.append(word.subSequence(1, word.length()));
    return sb;
}

we are filtering out the vechile whose built is greater than 1998 and we change the First index to model to uppercase as well as added the id because if we see in the entity our id is primarykey and in our csv we don't have ids.

After the completion of the processor we have vechileProcessor.

vechileProcessor

@Componentpublic class VechileWriter implements ItemWriter<Vechile>{

@Autowired
private VechileRepository vechileRepository;

@Override
public void write(List<? extends Vechile> vechiles) throws Exception {
    this.vechileRepository.saveAll(vechiles);
    
}

Here we have injected vechileRepository and called the saveAll() to persist the list of vechiles. After vechileWriter done, JobCompletionNotificationListner1 is triggered.After the completion of step1, step2 get triggered and follow the same steps accordingly.

Launching Job

@RestController
public class HomeController {
private static final Logger log = LoggerFactory.getLogger(HomeController.class);

@Autowired
private JobLauncher jobLauncher;

@Qualifier("job1")
@Autowired
private Job job1;

@Qualifier("job2")
@Autowired
private Job job2;

@Autowired
private VehileService vechileService;


@GetMapping("/launchJob1")
public String kickOffJob() {

    try {
        
        JobParameters jobParameters = new JobParametersBuilder().addLong("time",System.currentTimeMillis()).toJobParameters();
        jobLauncher.run(job1,jobParameters);
    
    } catch (Exception e) {
        log.info(e.getMessage());
    }

    return "Done";

}
....

Before Job Launch

Here we have autowired JobLauncher. Whenever http://localhost:8085/launchJob1 is hit job1 get triggered. run(..) Start a job execution for the given Job and JobParameters.

SpringBatch

Sabin Karki

System Requirement

ResourceUrl

About Project

Launching Job

After Job Launch

References Various Internet Sources

Project Link

More articles by this author

Others also viewed

If You Build It, Send it Out

Integrating a LIMS with SOAP 1.1 Only Capability and an Instrument Using JSON Only Capability via Middleware

A Comparative Analysis of Batch Processing in Spring Boot and ORMB

Parity First, Modernize Second: Why “LLM-Only” Isn’t Enough for Mainframe Modernization

Spring-Boot / Spring-Integration

Portfolio Projects and the "Definition of Done"

System Integration Without APIs: Creative Approaches That Deliver

10 Best Practices for Developing Spring Boot APIs

Advanced Guide to Background Jobs in Rails: Sidekiq Batches & Custom Middleware

REDEFINES – How to Save Memory and Reuse the Same Storage Area?

Explore content categories

System Requirement

ResourceUrl

About Project

Launching Job

After Job Launch

References Various Internet Sources

Project Link

Migration from JDK8 to JDK11

Jul 1, 2019

Integration Testing Using Cucumber-Java

Jun 4, 2018

Others also viewed

If You Build It, Send it Out

Integrating a LIMS with SOAP 1.1 Only Capability and an Instrument Using JSON Only Capability via Middleware

A Comparative Analysis of Batch Processing in Spring Boot and ORMB

Parity First, Modernize Second: Why “LLM-Only” Isn’t Enough for Mainframe Modernization

Spring-Boot / Spring-Integration

Portfolio Projects and the "Definition of Done"

System Integration Without APIs: Creative Approaches That Deliver

10 Best Practices for Developing Spring Boot APIs

Advanced Guide to Background Jobs in Rails: Sidekiq Batches & Custom Middleware

REDEFINES – How to Save Memory and Reuse the Same Storage Area?

Explore content categories