How to Scrape Data from Any Website | Scribe

How to Scrape Data from Any Website

  • Scott Colenutt |
  • 0 step |
  • 3 minutes
  • BrowseBrowse
Sign up for an account with [Browse AI](https://www.browse.ai/?utm_source=100daysofnocode&utm_medium=partner&utm_campaign=q3) and follow the onboarding instructions.
Inside the dashboard, select "**Build New Robot**"
Select the "**Extract Structured Data**" option.
When invited to create your Robot, click the "**Origin URL**" field and enter: <https://www.notion.so/careers>
Click "**Start Training Robot**"
Click "**Use Robot Studio**"
Select the option "**Capture Text**" followed by "**From list**" from the right-hand menu.
Scroll down to the Customer Experience section and **highlight the List Items as shown below.**\ \ When you are web scraping, you'll often be exporting data into spreadsheets, and so it can help to think of list items as rows in a spreadsheet. \ \ In this example, we want to capture the data for each role and output this data in a row. \ \ Highlight the List Items as shown in the screenshot below.
You'll now be prompted to select the items you want to scrape (or extract).\ \ Start by selecting the job title.
Next, select the location from the first job listing.
Finally, select the outer border of the first job listing, and select "**Link**" from the pop-up that appears.
Click "**Confirm**" in the right-hand menu.
You'll now be invited to label the elements you've selected, in order of how you selected them.\ \ In the first pop-up box that appears, label the item "**Job Role**" and click the✅
Next, label the location element with "**Location**" and click ✅
Finally, label the link with "**Job URL**" and click ✅
**Give your extracted data list a name**. In this example we'll use "Notion CX Roles"
Select "**10**" as the maximum number of rows you want to extract.
Click "**Select Pagination Setting**"
In the right-hand menu, select "**No more items to load**"
Click "**Save Captured List**"
0 Selected
This Scribe is in tip-top shape!Leave feedback if there are any issues with this Scribe