Webscraping | Sololearn: Learn to code for FREE!
New course! Every coder should learn Generative AI!
Try a free lesson
+ 2

Webscraping

Hi there So for school i have to do these massive projects, but i always need the date and author of the website i use to get my information. Sometimes getting the date is hard, because it's not always on it. So i thought i would webscrape it. I would use the "inurl:" google operater, but BeautifulSoup can't request the url. For example: "https://www.sololearn.com/Discuss/New/" would work. inurl:https://www.sololearn.com/Discuss/New/ doesn't work. I've been searching the web for a while but didn't find anything. Do you have some ideas i can try? I would really appreciate it. Thank you in advance Jordi

11th Jan 2020, 2:17 PM
Jordi
Jordi - avatar
2 Answers
0
"inurl:https://sololearn.com/Discuss/New" is not a valid url. A valid url is in the following form <http/https>://<www/subdomain/none>.<domain>.<domain-extention>/Path like: https://example.com/example Anyway, what data you want to scrape from what URL? Maybe I can help you with that
11th Jan 2020, 2:49 PM
Sarthak
Sarthak - avatar
0
Well i was planning to make a program that outputs the date of a website. When the user enters the url, it ouputs the date. It sucks that the "inurl" doesn't work, cuz it would always be in the same place for every website. anyways, thank you. I will find something
11th Jan 2020, 2:58 PM
Jordi
Jordi - avatar