You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
37 lines
2.2 KiB
Plaintext
37 lines
2.2 KiB
Plaintext
|
|
# 2025-04-01 11:38:13.877891
|
|
+n
|
|
|
|
# 2025-04-01 11:43:19.707960
|
|
+Hello, I want to create a script that visits a wordpress site in wordpress.com, gets every article (the complete text) and then creates an epub with it. Before showing me any code, do you think it's a feasible project? What technologies would you use? What would be the best way to do it? Let's discuss the nuances before writing the script. I can tell you it's a wordpress blog hosted in wordpress.com. I tried using the rss feed, but it only provides the excerpt, not the full text
|
|
|
|
# 2025-04-01 11:50:37.050448
|
|
+Ok, some considerations: the blog does not need authorization at all, and we can scrape it directly no problem (but agree on being respectful with the rate limits, I'm in no rush and there are a lot of articles in that blog). The url is https://cronicasperiodisticas.wordpress.com/ and I think I'm ready to see what you can do. If you have any doubts about decisions to make, please ask me and don't assume anything beforehand.
|
|
|
|
# 2025-04-01 11:52:49.968887
|
|
+Please review history and get working
|
|
|
|
# 2025-04-01 11:53:40.660487
|
|
+Ok, some considerations: the blog does not need authorization at all, and we can scrape it directly no problem (but agree on being respectful with the rate limits, I'm in no rush and there are a lot of articles in that blog). The url is https://cronicasperiodisticas.wordpress.com/ and I think I'm ready to see what you can do. If you have any doubts about decisions to make, please ask me and don't assume anything beforehand.
|
|
|
|
# 2025-04-01 11:53:50.334347
|
|
+n
|
|
|
|
# 2025-04-01 11:55:34.785299
|
|
+I want to create a script that downloads the full contents of a wordpress site. I tried rss, but it only shows the excerpt and I want the full articles. When I have them all, I want to create an epub file that I can import to my eReader. Can you help me? The url of the blog is https://cronicasperiodisticas.wordpress.com/
|
|
|
|
# 2025-04-01 11:55:39.343821
|
|
+n
|
|
|
|
# 2025-04-01 11:56:18.995236
|
|
+I'm doing this in a Mac and pip don't work too well. Is it possible to do it using pipx?
|
|
|
|
# 2025-04-01 11:56:32.927149
|
|
+/run pipx install requests beautifulsoup4 ebooklib
|
|
|
|
# 2025-04-01 11:56:32.927482
|
|
+/run pipx install requests beautifulsoup4 ebooklib
|
|
|
|
# 2025-04-01 11:56:48.443978
|
|
+n
|