I Have A Question...And A Formula...Googlesheets


#1

Hi

My question is:

What is the most quickest way to acquire/gather URL’s in Googlesheets to begin using IMPORTXLM?

I have 2,000 URL’s that I want to use in Googlesheets, (which I will be integrating into Airtable) to import certain information from each website, they have to be in a particular sequence, but they are not one after the other. I started to bookmark the sites that I needed, in sequence, but that can be tedious. Then I noticed that I could drag the URL’s from one site, in a split screen on my iPad, into Googlesheets, but I wasn’t sure if I could use those same website links, that I dragged and dropped, when I use IMPORTXLM on my MacBook.

I found out about a formula:

=ArrayFormula(“https://www.canadianpostagestamps.ca/"&filter(IMPORTXML(“https://www.canadianpostagestamps.ca/year/”&A2,"//a/@href"),REGEXMATCH(IMPORTXML(“https://www.canadianpostagestamps.ca/year/”&A2,"//a/@href"),"/stamps/”)))

Could this formula.be tailer made for selecting individual URL’S, by date or month and year? I was using it, but I could not delete the records that I did not need, it would delete a bunch of records, even after just selecting random records. Then when I tried to put in a different year, I would just duplicate the URL’S in the same year (2001) as the sample given. from the formula.

Then I thought I would post this to find out if there is a quick and easy way to do this phase of the operation, before I use IMPORTXLM or am I off the mark.

Any help would be appreciated.

Thank you,
Mary


#2

I Hi

I have since found out that the website that I wanted to scrape images from, is locked, so I cannot use those images.

I do have a question. I am curious, if I could could make some small changes to the formula below to include specific dates, especially where it says “year” in the formula or a way to make copying URL’s from a website to GS a lot easier and quicker:

=ArrayFormula(“https://www.canadianpostagestamps.ca/"&filter(IMPORTXML(“https://www.canadianpostagestamps.ca/year/”&A2,"//a/@href"),REGEXMATCH(IMPORTXML(“https://www.canadianpostagestamps.ca/year/”&A2,"//a/@href"),"/stamps/”)))

Thank you,
MK