Traverse the web with Croft. This documentation gives you a clear guide on how you can use Croft & CSS selectors to get critical data on the internet! Just build an Impression and within seconds, your data is ready to exported as JSON πΉοΈ
Similar to a thumb impression that highlight unique details, Croftβs Impressions are configurations used to identify details that you want to extract from a website. These configurations consists of:
CSS Selectors of Content
The following is a CSS selector to get the <a>
tag embedded within <p>
tag with a class of description
p.description > a
HTML Attributes of Content
These can include href
in the following tag
<a href="<https://croft.so>"> Hello World </a>
To start building an Impression, navigate to Impressions page and click the β+β
icon, then, give your Impression a title. Click βcreateβ
Next, add in the URL of the site you want to scrape. In this case, we will be using Books To Scrape to get book thumbnails.
Within the home page, we can use developer tools to inspect the class of book cover images. We can see that the class name of the image tag is thumbnail
As a CSS selector, it will look like this:
img.thumbnail