如何使用Python爬取亚马逊卖家:2025年指南

Expert Network Defense Engineer
亚马逊是全球最大的电商网站,拥有无数产品和数百万客户。难怪零售商都想分一杯羹。
亚马逊店铺页面帮助这些零售商进行市场调研。它显示了亚马逊上店铺的详细信息,可以作为完善您自己店铺信息的指南。但是,很难追踪像亚马逊这样的购物网站上的所有细节。
这就是网络爬取API非常有效的解决方案所在——尽可能消除所有手动操作。无需再手动滚动并记下您认为应该记住的产品价格和店铺信息!使用强大的亚马逊爬取API,让爬取变得简单!
在本教程中,我们将学习如何使用带有Python的API来爬取亚马逊店铺详细信息。
让我们开始吧!
为什么我们要爬取亚马逊卖家信息?
- 竞争情报
- 市场和趋势分析
- 定价策略优化
- 产品研究和选择
- 客户洞察和情感分析
- 供应链和供应商研究
- 监控销售业绩
什么是Scrapeless以及为什么选择它来爬取亚马逊卖家信息?
Scrapeless是一个强大的API工具。它可以与Python无缝集成,旨在满足开发人员和非开发人员的需求。
它简化了爬取亚马逊卖家数据的整个过程,使用户可以轻松可靠地从亚马逊平台提取各种有效信息。
无论您是刚开始爬取数据的初学者,还是正在寻找高效爬取解决方案的经验丰富的开发人员,Scrapeless都提供简单而强大的功能来满足您的需求。
Scrapeless的优势:
🌐 1. 独特的IP轮换技术
当从电商平台(例如亚马逊)爬取数据时,频繁的请求通常会导致IP被封锁和爬取失败。Scrapeless内置的IP轮换技术会在每次请求时自动更改IP地址,有效防止被封锁。
🔒 2. 自动化验证码检测和绕过
亚马逊经常会触发验证码或反机器人挑战,尤其是在爬取大量亚马逊数据时。Scrapeless可以自动检测并绕过验证码,减少手动干预的需求。此功能可以显著提高您的亚马逊爬取Python项目的成功率,验证码解决率超过99%。
⚡ 3. 高效的爬取速度
速度是其核心优势之一。凭借优化的代码结构和并发爬取能力,Scrapeless可以显著提高您爬取亚马逊数据时的效率,使其成为使用亚马逊爬取Python工具的理想选择。
🚀 4. 持续爬取能力
对于需要长期稳定爬取数据的用户,Scrapeless提供了极佳的稳定性。它可以持续爬取数千条记录而不会出现常见的崩溃或失败,确保您的亚马逊爬虫Python可以长时间无缝运行。
🛠️ 5. 易于使用的API和可视化界面
Scrapeless提供了一个直观的API,使开发人员能够快速爬取亚马逊数据并检索所需的产品详细信息。对于非技术用户,它还提供了一个简单的界面和示例代码,以降低使用门槛。超过90%的用户高度评价Scrapeless的易用性,使其成为任何想要创建亚马逊Python爬虫而无需深入编程的理想选择。
加入我们的Scrapeless Discord群组并获得免费试用!
如何使用Scrapeless API爬取亚马逊卖家信息?

更多指南
根据上面产品详细信息返回的数据,您可以找到seller_url
字段,这是对应产品的商家字段。通过seller_url
,您可以直接访问相应商家的详细信息。
当然,如果您知道卖家ID,也可以自己构建URL,如下所示:
以 https://www.amazon.com/sp?seller=AESX3141EPI7X 为例
您只需要将“AESX3141EPI7X”更改为您要访问的卖家ID。
步骤1. 获取您的API密钥
登录Scrapeless后,系统会自动为您生成相应的ApiKey。您可以
- 点击“API密钥管理”
- 然后点击“查看API密钥”

步骤2. 将我们的代码集成到您的项目中
您只需要进行2个操作:
- 替换为您的目标卖家URL
- 输入您的API密钥
Python
import json
import requests
class Payload:
def __init__(self, actor, input_data):
self.actor = actor
self.input = input_data
def send_request():
host = "api.scrapeless.com"
url = f"https://{host}/api/v1/scraper/request"
token = "" ## 输入您的API令牌
headers = {
"x-api-token": token
}
input_data = {
"action": "seller",
"url": "https://www.amazon.com/sp?seller=AESX3141EPI7X" ## 替换为您的目标卖家的URL
}
payload = Payload("scraper.amazon", input_data)
json_payload = json.dumps(payload.__dict__)
response = requests.post(url, headers=headers, data=json_payload)
if response.status_code != 200:
print("Error:", response.status_code, response.text)
return
print("body", response.text)
if __name__ == "__main__":
send_request()
- 您可以在我们的API文档中找到更多语言。为了使上述项目更具体,原始Python代码是:
Python
import requests
import json
url = "https://api.scrapeless.com/api/v1/scraper/request"
payload = json.dumps({
"actor": "scraper.amazon",
"input": {
"url": "",
"action": "seller"
}
})
headers = {
'Content-Type': 'application/json'
}
response = requests.request("POST", url, headers=headers, data=payload)
print(response.text)
步骤3. 开始爬取并获取输出
以下是Scrapeless亚马逊爬取API返回的卖家详细信息:
JSON
{
"url": "https://www.amazon.com/sp?seller=AESX3141EPI7X",
"seller_id": "AESX3141EPI7X",
"seller_name": "XXX",
"description": "About SellerXXX is proud to offer you the best quality products with the best quality service. Customer satisfaction is our number #1 priority! If you have any questions or concerns about your order, please don't hesitate to contact us at: 1-844-637-1400 Our customer service hours are Monday thru Friday, 10 AM - 5 PM Eastern Time. Looking forward to hearing from you!",
"detailed_info": [
{
"title": "Business Name:",
"value": "ADN GLOBAL LLC"
},
{
"title": "Business Address:",
"value": "502 Jersey Ave,STE A,NEW BRUNSWICK,NJ,08901,US"
}
],
"feedbacks": [
{
"starts": "5 out of 5 stars",
"text": "good",
"date": "By Yenny albarracin on December 26, 2024."
},
{
"starts": "4 out of 5 stars",
"text": "Aurticulo en buen estado y muy eficiente en la entrega",
"date": "By Juan D. on December 26, 2024."
},
{
"starts": "4 out of 5 stars",
"text": "Good experience was received on time",
"date": "By Symon Harry on December 25, 2024."
},
{
"starts": "5 out of 5 stars",
"text": "It’s a gift",
"date": "By Patty T. on December 25, 2024."
},
{
"starts": "1 out of 5 stars",
"text": "Disappointed with service. Order in November and gift will not be here before Christmas. My child will be so disappointed.",
"date": "By Rosey M. on December 24, 2024."
}
],
"stars": "4.5 out of 5 stars",
"return_policy": "To get information about the Return and Refund policies that may apply, please refer to Amazon’s Return and Refund policy.To initiate a return, visit Amazon's Online Return Center to request a return authorization from the seller. For any issues with your return, if the product was shipped by the seller, you can get help here.",
"shipping_policies": "Unless noted otherwise in the ordering pipeline, XXX ships all items within two days of receiving an order. You will receive notification of any delay or cancellation of your order.",
"privacy_security": "Amazon knows that you care how information about you is used and shared, and we appreciate your trust that we will do so carefully and sensibly. By visiting Amazon.com, you are accepting the practices described in Amazon.com's Privacy Policy . In addition, we want you to be aware that Amazon.com will provide XXX with information related to your transactions involving their products (including, for example, your name, address, products you purchase, and transaction amount), and that such information will be subject to XXX's Privacy Policy.",
"privacy_policy": "XXX values the privacy of your personal data. For more information see Amazon.com's Privacy Policy .",
"tax_info": "Sales tax is not separately calculated and collected in connection with items ordered from XXX through the Amazon.com Site unless explicitly indicated as such in the ordering process. Items ordered from XXX may be subject to tax in certain states, based on the state to which the order is shipped. If an item is subject to sales tax, in accordance with state tax laws, the tax is generally calculated on the total selling price of each individual item, including shipping and handling charges, gift-wrap charges and other service charges, less any applicable discounts. If tax is separately calculated and collected in connection with items ordered from XXX through the Amazon.com Site, the tax amounts that appear during the ordering process are estimated - the actual taxes that will be charged to your credit card will be calculated at the time your order is processed and will appear in your order confirmation notification.",
"help_content": "For questions about a charge that has been made to your credit card, please contact Amazon. Questions about how to place an order? Search Amazon Help.",
"products_link": "https://www.amazon.com/s?ie=UTF8&marketplaceID=ATVPDKIKX0DER&me=AESX3141EPI7X",
"business_name__DUPLICATE": "XXX",
"business_address__DUPLICATE": "XXX",
"rating_positive": "90% positive",
"brands": "",
"feedbacks_percentages": {
"star_1": "7%",
"star_2": "2%",
"star_3": "2%",
"star_4": "11%",
"star_5": "79%"
},
"rating_count_m12": "1,143",
"rating_count_m3": "276",
"rating_count_lifetime": "21,128",
"rating_count_m1": "118",
"country": "US",
"email": "",
"timestamp": "2024-12-26"
}
Scrapeless控制面板:爬取亚马逊卖家的最简单方法
上述Python步骤对许多人来说可能比较麻烦。为了减轻企业爬取的负担,Scrapeless控制面板已经为您节省了麻烦的请求启动过程。您只需点击几下并进行简单的配置即可轻松爬取卖家信息。
现在继续滚动!
步骤1. 登录Scrapeless
步骤2. 点击“爬取API”并选择“亚马逊”进入亚马逊爬取页面。

步骤3. 复制目标卖家URL并将其输入框中。将“操作”切换为“卖家”,然后点击“开始爬取”按钮。

在工具页面上,您可以选择要爬取的数据类型:
- 卖家: 爬取卖家信息,包括卖家名称、评分、联系信息等。
- 产品: 爬取产品详细信息,例如标题、价格、评分、评论等。
- 关键词: 爬取与产品相关的关键词,以帮助您分析产品的SEO和市场趋势。
步骤4. 爬取完成后,您可以在右侧面板中查看爬取的数据。结果将以清晰的格式显示,便于分析。

如果您需要爬取其他产品,请点击继续,输入新的亚马逊链接并重复上述步骤。
总结
虽然有多种方法可以爬取亚马逊卖家页面,但是自己编写代码来实现可能会比较困难。您需要手动设置浏览器自动化才能使其工作,还需要从检索到的HTML中解析出字段。
是时候减轻所有负担,轻松爬取数据了!使用强大的Scrapeless亚马逊爬取API,实现简单、高效、准确、快速、稳定和安全的的数据爬取。
在Scrapeless,我们仅访问公开数据,同时严格遵守适用的法律、法规和网站隐私政策。 本博客内容仅用于演示目的,不涉及任何非法或侵权行为。 对于使用本博客或第三方链接的信息,我们不做任何保证,也不承担任何责任。 在参与任何抓取活动之前,请咨询您的法律顾问并查看目标网站的服务条款或获取必要的权限。