Job spider: automatic vacancies scraping from Employer
websites / ATSes and bulk posting to your job board.
Web spider: any info grabbing from online sources

Posts Tagged ‘html tags’

Job Scraping Example

Basic employer job search form usage, vacancy list spidering, parsing and XML generation example:

 

1. Job search form and list of jobs to spider:

- scrape all jobs or specify search criteria
- schedule for daily or weekly spidering
- filter out by desired / non-desired keywords

list of jobs

 

2. Job page downloading instructions abstract (Job Spider tool):
Job scrape

 

3. Example of a job advert scraped:
Fields highlighted will be extracted as per instructions below.

Job content

 

4. Source of the job to parse: 
HTML and Javascript tags are used to identify job content.

job html

 

5. Job Spider configuration instructions for parsing:
Regular expressions are used for flexible content extraction from HTML source.

Parsing rules

 

6. Resulting XML to be auto-posted to your job board interface:
Match XML file to your job board fields for correct posting.

Job XML

Redundant HTML Tags Removal From Content

It is not a rare occasion for the jobs downloaded from Employer website to hold redundant HTML tags like <b>, <br>, <font>, etc. For sometimes jobs are posted by copy-pasting from MS Word to WYSIWYGs…

 

We have added a tag clean up feature to Job Spider to solve the issue so excessive  formatting does not compromise job descriptions, page titles and headings  presentation on your job board.


Home   Products   Demo   Pricing   About Us   Contacts   Blog  
© Copyright 2008. All rights reserved
Aspen Technology Labs