Internet Archive Bulk cli Uploading

For interesting topics. But remember this is a Linux Forum. Do not post offensive topics that are meant to cause trouble with other members or are derogatory towards people of different genders, race, color, minors (this includes nudity and sex), politics or religion. Let's try to keep peace among the community and for visitors.

No spam on this or any other forums please! If you post advertisements on these forums, your account may be deleted.

Do not copy and paste entire or even up to half of someone else's words or articles into posts. Post only a few sentences or a paragraph and make sure to include a link back to original words or article. Otherwise it's copyright infringement.

You can talk about other distros here, but no MX bashing. You can email the developers of MX if you just want to say you dislike or hate MX.
Post Reply
Message
Author
User avatar
manyroads
Posts: 2623
Joined: Sat Jun 30, 2018 6:33 pm

Internet Archive Bulk cli Uploading

#1 Post by manyroads »

Hello all.

I am attempting to bulk upload a bunch of images (pdf files 18GB worth) to the Internet Archive. I think I am almost there but my brain is officially in overload. Maybe a smart person can help. What follows is what I have achieved thus far. I think for an old guy, it's "almost adequate" :lipsrsealed: ... but not quite good "enough". :bagoverhead:

Here's what I've done & where I'm stuck.

To get the internetarchive (ia) command environment functioning you need to"

Code: Select all

$ sudo apt-get install python-pip python-pip-whl pypy-six
$ pip install --upgrade setuptools
$ sudo pip install internetarchive
Next the InternetArchive Command line interface needs to be installed and verified:

Code: Select all

$ curl -LOs https://archive.org/download/ia-pex/ia
$ chmod +x ia
$ ./ia help
The remaining instructions may be found here: https://archive.org/services/docs/api/i ... tml#upload

The challenge I'm having is in building a functioning csv spreadsheet to use for uploading. Everything else seems okay; but, how would I know without a functioning csv load spreadsheet.

Is a :crossfingers: nyone able to help?
Pax vobiscum,
Mark Rabideau - ManyRoads Genealogy -or- eirenicon llc. (geeky stuff)
i3wm, bspwm, hlwm, dwm, spectrwm ~ Linux #449130
"For every complex problem there is an answer that is clear, simple, and wrong." -- H. L. Mencken

User avatar
Paul..
Posts: 1777
Joined: Sun Mar 18, 2007 6:34 pm

Re: Internet Archive Bulk cli Uploading

#2 Post by Paul.. »

Stevo is probably your best bet...just be patient...he is on the west coast.

Asus Prime X570-Pro | AMD Ryzen 7 3700X
16 Gig DDR4 3600 | Radeon RX 5600 XT Graphics
Samsung 860 500GB SSDs (2)

User avatar
manyroads
Posts: 2623
Joined: Sat Jun 30, 2018 6:33 pm

Re: Internet Archive Bulk cli Uploading

#3 Post by manyroads »

Paul.. wrote: Tue Jan 29, 2019 5:17 pm Stevo is probably your best bet...just be patient...he is on the west coast.
Thanks, @Paul.
Pax vobiscum,
Mark Rabideau - ManyRoads Genealogy -or- eirenicon llc. (geeky stuff)
i3wm, bspwm, hlwm, dwm, spectrwm ~ Linux #449130
"For every complex problem there is an answer that is clear, simple, and wrong." -- H. L. Mencken

User avatar
Paul..
Posts: 1777
Joined: Sun Mar 18, 2007 6:34 pm

Re: Internet Archive Bulk cli Uploading

#4 Post by Paul.. »

Are you trying to do bulk uploading? Or, are you trying another path because the web-based upload is too slow?

Asus Prime X570-Pro | AMD Ryzen 7 3700X
16 Gig DDR4 3600 | Radeon RX 5600 XT Graphics
Samsung 860 500GB SSDs (2)

User avatar
manyroads
Posts: 2623
Joined: Sat Jun 30, 2018 6:33 pm

Re: Internet Archive Bulk cli Uploading

#5 Post by manyroads »

Paul.. wrote: Tue Jan 29, 2019 5:59 pm Are you trying to do bulk uploading? Or, are you trying another path because the web-based upload is too slow?
I'm trying to bulk upload (some 35 files consuming 14Gb of space) to a pre-existing folder. Doing that with their web-based upload will complete a year or two after my demise. :p Genealogist talk...
Pax vobiscum,
Mark Rabideau - ManyRoads Genealogy -or- eirenicon llc. (geeky stuff)
i3wm, bspwm, hlwm, dwm, spectrwm ~ Linux #449130
"For every complex problem there is an answer that is clear, simple, and wrong." -- H. L. Mencken

User avatar
Stevo
Developer
Posts: 12837
Joined: Fri Dec 15, 2006 8:07 pm

Re: Internet Archive Bulk cli Uploading

#6 Post by Stevo »

Paul.. wrote: Tue Jan 29, 2019 5:17 pm Stevo is probably your best bet...just be patient...he is on the west coast.
Umm...probably no help from me. :frown: I've just done the point and click web interface uploads, so you've gone way past me now.

User avatar
Paul..
Posts: 1777
Joined: Sun Mar 18, 2007 6:34 pm

Re: Internet Archive Bulk cli Uploading

#7 Post by Paul.. »

It's a shame you can't use rsync like sourceforge.org

Asus Prime X570-Pro | AMD Ryzen 7 3700X
16 Gig DDR4 3600 | Radeon RX 5600 XT Graphics
Samsung 860 500GB SSDs (2)

User avatar
manyroads
Posts: 2623
Joined: Sat Jun 30, 2018 6:33 pm

Re: Internet Archive Bulk cli Uploading

#8 Post by manyroads »

Stevo wrote: Tue Jan 29, 2019 6:05 pm
Paul.. wrote: Tue Jan 29, 2019 5:17 pm Stevo is probably your best bet...just be patient...he is on the west coast.
Umm...probably no help from me. :frown: I've just done the point and click web interface uploads, so you've gone way past me now.
I guess it'll be Google time... I'll see if I can find a really big hammer and some duct tape to make it work. :eek:
Pax vobiscum,
Mark Rabideau - ManyRoads Genealogy -or- eirenicon llc. (geeky stuff)
i3wm, bspwm, hlwm, dwm, spectrwm ~ Linux #449130
"For every complex problem there is an answer that is clear, simple, and wrong." -- H. L. Mencken

skidoo
Posts: 753
Joined: Tue Sep 22, 2015 6:56 pm

Re: Internet Archive Bulk cli Uploading

#9 Post by skidoo »

using pip is the python equivalent of installing .deb files from PPAs, so let's not go there, eh.
You could git clone (or download via html) that same "ia" commandline program from
https://github.com/jjjake/internetarchive
but I don't know offhand whether its requirements are available as debian packages (pkgnames may differ from the exact names of the python libraries they contain)

I found this
https://internetarchive.readthedocs.org ... t/cli.html
which redirects to this
https://internetarchive.readthedocs.io/ ... t/cli.html
placeholder page which states "this page does not exist"
so I don't have much confidence the "ia" utility is, or ever was, ready for primetime...

User avatar
manyroads
Posts: 2623
Joined: Sat Jun 30, 2018 6:33 pm

Re: Internet Archive Bulk cli Uploading

#10 Post by manyroads »

skidoo wrote: Tue Jan 29, 2019 8:08 pm using pip is the python equivalent of installing .deb files from PPAs, so let's not go there, eh.
You could git clone (or download via html) that same "ia" commandline program from
https://github.com/jjjake/internetarchive
but I don't know offhand whether its requirements are available as debian packages (pkgnames may differ from the exact names of the python libraries they contain)

I found this
https://internetarchive.readthedocs.org ... t/cli.html
which redirects to this
https://internetarchive.readthedocs.io/ ... t/cli.html
placeholder page which states "this page does not exist"
so I don't have much confidence the "ia" utility is, or ever was, ready for primetime...
I found a work around for my immediate challenge. I had no problem with the ia install really. I even found mention of others on debian doing somethings very similar for other python apps. Now that I've passed my urgency obstacle; I'll see what I can do.
Pax vobiscum,
Mark Rabideau - ManyRoads Genealogy -or- eirenicon llc. (geeky stuff)
i3wm, bspwm, hlwm, dwm, spectrwm ~ Linux #449130
"For every complex problem there is an answer that is clear, simple, and wrong." -- H. L. Mencken

Post Reply

Return to “General”