PHP Classes

Puller: Scrape product data pages from e-commerce sites

Recommend this page to a friend!
  Info   View files View files (15)   DownloadInstall with Composer Download .zip   Reputation   Support forum   Blog    
Last Updated Ratings Unique User Downloads Download Rankings
2024-01-09 (4 days ago) RSS 2.0 feedNot enough user ratingsTotal: 894 All time: 3,946 This week: 175Up
Version License PHP version Categories
puller 0.1.11GNU Lesser Genera...5.3PHP 5, E-Commerce, Web services
Description 

Author

This package can scrape product data pages from e-commerce sites.

It provides a base class that can retrieve pages from Web sites and extract information from them.

Several implementation classes are provided to retrieve product pages given the product identifier.

Those classes extract relevant product information like the product price, description and picture from certain e-commerce sites.

Currently it provides classes for extracting product information from several Brazilian e-commerce sites like Americanas, NetShoes, Saraiva and Submarino.

Innovation Award
PHP Programming Innovation award winner
June 2013
Winner


Prize: One subscription to the PDF edition of the PHP Architect magazine
Many sites publish listings of products available in different stores.

The information of the products and the prices can be obtained from the stores if they provide it, or scraped directly from the pages of the sites.

This class provides a general solution for scraping product information scraped from the store Web sites.

Different driver classes are provides to scrape information from specific store sites.

Manuel Lemos
Picture of Evaldo Barbosa
  Performance   Level  
Innovation award
Innovation award
Nominee: 3x

Winner: 1x

 

Details

pp == You will get all needed information about a product in brazilian e-commerce sites. Simple to use, PP is the way to collect data about a product in brazilian e-commerce including price, description and picture. See the code of tests. To run the test: $ bin/phpunit -c tests/phpunit.xml --group=Info Example: ```php <? /** * Getting information from products */ use Puller\Target\SubmarinoProductInfo; //http://www.submarino.com.br/produto/111970051 $productid = 111970051; $p = new SubmarinoProductInfo( $productid ); echo $p->productId, "\n", $p->productName, "\n"; print_r( $p->productTable ); $p = new NetShoesProductInfo( '094-0460-014-03' ); echo $p->productId, "\n", $p->productName, "\n"; print_r( $p->productTable ); $p = new PontoFrioProductInfo( 'TelefoneseCelulares/Smartphones/Celular-Desbloqueado-Motorola-RAZR-i-Preto-com-Processador-Intel-de-2-GHz-Tela-de-4-3’’-Android-4-0-Camera-8MP-Wi-Fi-3G-NFC-GPS-e-Bluetooth-1748861.html' ); echo $p->productId, "\n", $p->productName, "\n"; print_r( $p->productTable ); </pre> ``` ## Important > *This software now is under LPGLv3. Se this:*<br/> > *http://www.gnu.org/copyleft/lesser.txt* > Winner - June 2013 > <img src="http://www.phpclasses.org/award/innovation/winner.png"/><br/> > http://www.phpclasses.org/package/8077-PHP-Scrape-product-data-pages-from-e-commerce-sites.html

  Files folder image Files  
File Role Description
Files folder imagesrc (2 directories)
Files folder imagetests (2 files, 1 directory)
Accessible without login Plain text file composer.json Data Auxiliary data
Accessible without login Plain text file LICENCE Data Auxiliary data
Accessible without login Plain text file README.md Data Auxiliary data

 Version Control Unique User Downloads Download Rankings  
 100%
Total:894
This week:0
All time:3,946
This week:175Up