Java Developer with Web Crawler Experience Job at Axiom Software Solutions Limited, Austin, TX

YUtNV3pVdkZ4Rmd3UGFLb3JLV1pnWVBWQ0E9PQ==
  • Axiom Software Solutions Limited
  • Austin, TX

Job Description

Job Description

Job Description

Role: Java Developer with Web Crawler Experience

Location: Austin TX(Hybrid)

Responsibilities:

1.Web Crawler Development: Design and implement efficient and scalable web crawlers in Java to collect data from various online sources.

2.Data Extraction: Develop and maintain systems for structured data extraction, handling various data formats (HTML, JSON, XML, etc.).

3.Data Storage and Processing: Design data storage and processing pipelines, ensuring extracted data is clean, structured, and easily accessible.

4.Performance Optimization: Optimize web crawling processes for speed, efficiency, and accuracy, while ensuring minimal impact on source websites.

5.Error Handling and Logging: Implement error-handling mechanisms and logging systems to detect and resolve issues during crawling operations.

6.Data Integrity and Compliance: Ensure data collection practices are ethical, legal, and compliant with relevant regulations (e.g., robots.txt, copyright laws).

Requirements:

Proficiency in Java and experience with Java-based web scraping libraries (e.g., Jsoup, Apache

Knowledge of web crawling frameworks and tools, such as Scrapy, Selenium, or Puppeteer.

Strong understanding of HTML, CSS, JavaScript, and web data structures.

Familiarity with data parsing and handling techniques for JSON, XML, and other common formats.

Experience with database technologies (SQL, NoSQL) to store and manage scraped data.

Knowledge of protocols, headers, proxies, and load handling.

Job Tags

Similar Jobs

Princeton Staffing Solutions

Travel Outpatient Occupational Therapist - $1,800 per week Job at Princeton Staffing Solutions

 ...Princeton Staffing Solutions is seeking a travel Outpatient Occupational Therapist for a travel job in Oklahoma City, Oklahoma. Job Description & Requirements ~ Specialty: Occupational Therapist ~ Discipline: Therapy ~ Duration: 13 weeks ~40 hours per week... 

Diverse Lynx

angular + Node JS developer Job at Diverse Lynx

 ...experience as a lead devHave proficient understanding of HTML5, CSS3, JavaScript and Typescript. Have worked on Angular 13+, developing components and consuming REST APIs. Have worked on web application development. Have thorough knowledge on Unit testing Angular... 

Timmons Subaru

Virtual Assistant Job at Timmons Subaru

Job Summary: We are seeking a highly organized and self-motivated individual to join our team as a Virtual Assistant. As a Virtual Assistant, you will provide remote administrative support to individuals or businesses. Your responsibilities will include managing schedules... 

Prince William County VA

Library Assistant (Part Time) Job at Prince William County VA

 ...reading, supporting lifelong learning, and serving your community? Join our team in Haymarket, VA, as a Library Assistant! Our library is a vibrant hub where people of all ages connect, learn, and grow. W Library, Part Time, Assistant, Technology Prince William County VA

Siri InfoSolutions Inc

Java Developer with GCP Job at Siri InfoSolutions Inc

 ...Contract Job Description ~8+ years of professional experience as a Java Engineer ~ Strong knowledge of Java languages and web development frameworks like Spring, Hibernate, and Struts. ~ Expertise in developing web applications using front-end technologies (HTML,...