포트폴리오

This service is available to members only.

Real Estate Address Book Crawling

almaster institute

#Real Estate, #Web

almaster institute

Overview

At the request of Alma Mater Institute, IT7 successfully completed a project to crawl real estate information nationwide. This project involved collecting data related to real estate from various websites, structuring it, and integrating it into Alma Mater Institute's database. The collected data includes information on real estate agents, property listings, and regional real estate prices. Through this project, Alma Mater Institute can make more accurate and reliable data-driven decisions.

Project Objectives

The main objectives of this project are as follows:

  1. Collect Real Estate Information: Gather real estate data, including property listings, real estate agent information, and regional price trends, from across the country.
  2. Structure the Data: Convert the collected data to match the structure of Alma Mater Institute's database and maintain data consistency.
  3. Integrate the Database: Integrate the structured data into Alma Mater Institute's existing database to facilitate data analysis and visualization.
  4. Create a Website Portfolio: Publish the project results on IT7's website as a portfolio to showcase our technical capabilities and achievements.

Technology Stack

The following technologies were utilized in this project:

  • HTML5: Used to create the structure and content of web pages.
  • JavaScript: Used to implement dynamic functions on the client side and to process data.
  • jQuery: A JavaScript library used to simplify HTML document traversal, event handling, animations, and AJAX interactions.
  • Python: Primarily used for web crawling and data processing. Libraries such as BeautifulSoup and Selenium were employed to extract necessary information from web pages.
  • MySQL: A database system used to store and manage the collected data.

Work Process

  1. Requirements Analysis: Collaborated with Alma Mater Institute to define the types and scope of data needed. Designed the data format and database structure.
  2. Web Crawler Development: Developed scripts to crawl various real estate websites using Python. Utilized BeautifulSoup to parse HTML documents and Selenium to collect data from dynamically loaded pages.
  3. Data Collection and Cleaning: The collected data underwent processes such as deduplication, format conversion, and outlier handling. This ensured data consistency and reliability.
  4. Database Integration: Stored the cleaned data in a MySQL database and performed integration tasks with existing data. Various validation procedures were implemented to maintain data consistency.
  5. Portfolio Creation: Organized the project's achievements and technical details into a portfolio and published it on IT7's website. The web page was built using HTML5, JavaScript, and jQuery to visually represent the key aspects of the project.

Results and Expected Benefits

Through this project, IT7 was able to systematically provide Alma Mater Institute with nationwide real estate information. This allows Alma Mater Institute to make more accurate and reliable data-driven decisions, ultimately aiding in real estate market analysis and strategy formulation.

Additionally, this project provided a great opportunity for IT7 to promote our technical capabilities externally. The portfolio published on the website effectively showcases our expertise and achievements, which is expected to positively impact the acquisition of new clients and project contracts.

Conclusion

Through this project, IT7 effectively crawled nationwide real estate information and successfully integrated it into Alma Mater Institute's database. We are delighted to promote our technical capabilities and achievements through the website portfolio. IT7 will continue to leverage cutting-edge technology to provide optimal solutions that meet the needs of our clients.

  • 2018.10 ~ 2018.10
  • almaster institute