Job spider: automatic vacancies scraping from Employer
websites / ATSes and bulk posting to your job board.
Web spider: any info grabbing from online sources

Posts Tagged ‘job parsing’

Job Scraping Example

Basic employer job search form usage, vacancy list spidering, parsing and XML generation example:

 

1. Job search form and list of jobs to spider:

- scrape all jobs or specify search criteria
- schedule for daily or weekly spidering
- filter out by desired / non-desired keywords

list of jobs

 

2. Job page downloading instructions abstract (Job Spider tool):
Job scrape

 

3. Example of a job advert scraped:
Fields highlighted will be extracted as per instructions below.

Job content

 

4. Source of the job to parse: 
HTML and Javascript tags are used to identify job content.

job html

 

5. Job Spider configuration instructions for parsing:
Regular expressions are used for flexible content extraction from HTML source.

Parsing rules

 

6. Resulting XML to be auto-posted to your job board interface:
Match XML file to your job board fields for correct posting.

Job XML

Redundant HTML Tags Removal From Content

It is not a rare occasion for the jobs downloaded from Employer website to hold redundant HTML tags like <b>, <br>, <font>, etc. For sometimes jobs are posted by copy-pasting from MS Word to WYSIWYGs…

 

We have added a tag clean up feature to Job Spider to solve the issue so excessive  formatting does not compromise job descriptions, page titles and headings  presentation on your job board.

Selective Job Posting: Filtering Results

Jobs spidered from source websites are not alway a perfect fit for your job board niche. SpiderMount Job Spider solves the issue by adding must have / must not have filtering criteria for job parsing. So not all of the jobs downloaded from employer site would get to your jobs database.
 

Any job field parsed can be automatically checked so only relevant jobs are then posted to your job board.
  

Example: filtering criteria configuration for job title 


Home   Products   Demo   Pricing   About Us   Contacts   Blog  
© Copyright 2008. All rights reserved
Aspen Technology Labs