如何一次下載SlideShare頻道裡面的所有簡報?
How to Download All the PPTs in Specific SlideShare Channel ?
2 min readDec 15, 2019
In this tutorial, We will cover the method to download files
First, you should use Linux OS as your environment. If you use Windows OS, you can use VirtualBox to install Ubuntu or any linux OS.
Clone this Github Repo using git
git clone https://github.com/agungw132/slideshare-downloader
Install the required dependencies
#This python code use both python 2 and python 3pip2 install -r requirements_py2.txt #python 2
pip3 install -r requirements_py3.txt #python 3
Download all the links in specific channel
Go to the channel (e.g., https://www.slideshare.net/NVIDIA/presentations from NVIDIA channel). Remember how many pages that this channel contains (e.g., NVIDIA has 12 pages). Download all links :
#change the directory to slideshare-downloader
cd ~/slideshare-downloader#create a variable "NVIDIA" (this is the name of the channel)
name="NVIDIA"#If this is the first time you try to download, you can skip this step. If not, you should remove the file(this is the last time you download from specific channel)
rm list.txt #Remember how many pages does the channels have? just change to the pages
for j in {1..12}; do python extractpage2.py https://www.slideshare.net/$name/presentations/$j >> list.txt; done#create the download list
cat list.txt | grep "/$name/" | grep -v "/presentations" | grep -v "/clipboards" | grep -v "/documents" | grep -v "/videos" | grep -v "/infographics" | grep -v "/followers" > list2.txt
Download the Slides
#In this step, the python program will start to download the slides in image format and then convert them to PDF formatfor i in `cat list2.txt`; do python3 convertpdf.py https://www.slideshare.net$i; done
If you just want to download ONLY a file
If you need only 1 file, e.g., https://www.slideshare.net/NVIDIA/nvidia-developer-program-overview-152503775
#just put the URLpython3 convertpdf.py https://www.slideshare.net/NVIDIA/nvidia-developer-program-overview-152503775