scraping a web site

classic Classic list List threaded Threaded
28 messages Options
12
Reply | Threaded
Open this post in threaded view
|

scraping a web site

Nick Thompson

Dear Phellow Phriammers,

 

I am in the uncomfortable position of being bound by threads of steel to Earthlink.  Many, MANY, years I go I started a website on Earthlink, {http://home.earthlink.net/~nickthompson/naturaldesigns/

}, and put a lot of my writing, and some commentary up on it.  The website creation and editing medium (trellix) was pretty good for its time, and there are many ways that I find the site quite satisfying.  But gradually Earthlink has withdrawn its support, and now I am not sure I could get in to edit or change it.  Meantime, Research Gate has gotten started, and provides a somewhat better place to meet the world and archive my stuff.  And also, having the site on earthlink binds me to them and their 22 dollar a month fee.  So. …

 

I am wondering if there is a way (or a service that would) scrape the website and, possibly, dump it into a new and more reliable, more website creation medium?  Please, ambulatory knowledge only.  I don’t want a people doing deep searches to answer this  question . 

 

Thanks, as always .

 

Nick

 

Nicholas S. Thompson

Emeritus Professor of Psychology and Biology

Clark University

http://home.earthlink.net/~nickthompson/naturaldesigns/

 


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Tom Johnson
Nick;
You might try installing Firefox, if you don't already use it, and go here to add-on DownThemAll.  I recall that you can set how many layers deep you want to go.  Of course if you get ALL your content you will have to figure out where and how you want to repost it.


Tom

On Jan 4, 2017 12:50 PM, "Nick Thompson" <[hidden email]> wrote:

Dear Phellow Phriammers,

 

I am in the uncomfortable position of being bound by threads of steel to Earthlink.  Many, MANY, years I go I started a website on Earthlink, {http://home.earthlink.net/~nickthompson/naturaldesigns/

}, and put a lot of my writing, and some commentary up on it.  The website creation and editing medium (trellix) was pretty good for its time, and there are many ways that I find the site quite satisfying.  But gradually Earthlink has withdrawn its support, and now I am not sure I could get in to edit or change it.  Meantime, Research Gate has gotten started, and provides a somewhat better place to meet the world and archive my stuff.  And also, having the site on earthlink binds me to them and their 22 dollar a month fee.  So. …

 

I am wondering if there is a way (or a service that would) scrape the website and, possibly, dump it into a new and more reliable, more website creation medium?  Please, ambulatory knowledge only.  I don’t want a people doing deep searches to answer this  question . 

 

Thanks, as always .

 

Nick

 

Nicholas S. Thompson

Emeritus Professor of Psychology and Biology

Clark University

http://home.earthlink.net/~nickthompson/naturaldesigns/

 


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove

============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Marcus G. Daniels

Once you’ve got all the files (like below) Microsoft Word can import HTML files.   Editors designed to HTML editing (e.g. Kompozer) will often have a “Open from web” option such that you can just type the URL.   If you really want systematic scraping, look at libraries like Beautifulsoup (Python based), but that will involve some programming.

 

From: Friam [mailto:[hidden email]] On Behalf Of Tom Johnson
Sent: Tuesday, January 03, 2017 10:39 PM
To: Friam@redfish. com <[hidden email]>
Subject: Re: [FRIAM] scraping a web site

 

Nick;

You might try installing Firefox, if you don't already use it, and go here to add-on DownThemAll.  I recall that you can set how many layers deep you want to go.  Of course if you get ALL your content you will have to figure out where and how you want to repost it.

 

 

Tom

 

On Jan 4, 2017 12:50 PM, "Nick Thompson" <[hidden email]> wrote:

Dear Phellow Phriammers,

 

I am in the uncomfortable position of being bound by threads of steel to Earthlink.  Many, MANY, years I go I started a website on Earthlink, {http://home.earthlink.net/~nickthompson/naturaldesigns/

}, and put a lot of my writing, and some commentary up on it.  The website creation and editing medium (trellix) was pretty good for its time, and there are many ways that I find the site quite satisfying.  But gradually Earthlink has withdrawn its support, and now I am not sure I could get in to edit or change it.  Meantime, Research Gate has gotten started, and provides a somewhat better place to meet the world and archive my stuff.  And also, having the site on earthlink binds me to them and their 22 dollar a month fee.  So. …

 

I am wondering if there is a way (or a service that would) scrape the website and, possibly, dump it into a new and more reliable, more website creation medium?  Please, ambulatory knowledge only.  I don’t want a people doing deep searches to answer this  question . 

 

Thanks, as always .

 

Nick

 

Nicholas S. Thompson

Emeritus Professor of Psychology and Biology

Clark University

http://home.earthlink.net/~nickthompson/naturaldesigns/

 


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Robert J. Cordingley
In reply to this post by Nick Thompson

Hi Nick

Your old Earthlink site seems to comprise just about ten 'pages' of content, with many of those pages (Published Works) listing many bibliographic citations, each with a link to an image and further link to a pdf document. Grabbing all the content manually is perhaps tedious but doable. Saving all the pages as HTML is also doable but don't see a lot of point in that. Populating your Research Gate website should be possible too with in browser Copy and Paste - but I'm not familiar with RG - as should any other website builder, Wix, Squarespace, WordPress as well as hosting company  website builders. I don't know of an automated system but the Internet Archive must have something and already has multiple captures of past versions of your site - see https://web.archive.org/web/20151206005021/http://home.earthlink.net/~nickthompson/naturaldesigns/.

I think what you're really looking for is a web/content migration tool more so than web scraping tools which tend to be focused on capturing specific data, say contact information. Vamosa seems to offer a service that should do exactly what you want, see http://www.vamosa.com/vamosa-content-migrator-c124 but suspect that's aimed at large corporate clients. I have no experience with them. Googling 'website migration tools' produces lots of results - some questionable.

Hope this helps.

Thanks, Robert


On 1/3/17 9:49 PM, Nick Thompson wrote:

Dear Phellow Phriammers,

 

I am in the uncomfortable position of being bound by threads of steel to Earthlink.  Many, MANY, years I go I started a website on Earthlink, {http://home.earthlink.net/~nickthompson/naturaldesigns/

}, and put a lot of my writing, and some commentary up on it.  The website creation and editing medium (trellix) was pretty good for its time, and there are many ways that I find the site quite satisfying.  But gradually Earthlink has withdrawn its support, and now I am not sure I could get in to edit or change it.  Meantime, Research Gate has gotten started, and provides a somewhat better place to meet the world and archive my stuff.  And also, having the site on earthlink binds me to them and their 22 dollar a month fee.  So. …

 

I am wondering if there is a way (or a service that would) scrape the website and, possibly, dump it into a new and more reliable, more website creation medium?  Please, ambulatory knowledge only.  I don’t want a people doing deep searches to answer this  question . 

 

Thanks, as always .

 

Nick

 

Nicholas S. Thompson

Emeritus Professor of Psychology and Biology

Clark University

http://home.earthlink.net/~nickthompson/naturaldesigns/

 



============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove

-- 
Cirrillian 
Web Design & Development
Santa Fe, NM
http://cirrillian.com
281-989-6272 (cell)
Member Design Corps of Santa Fe

============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Nick Thompson

Dear Robert, Tom, and Marcus,

 

I am not sure how I would survive in this complicated world without this ability to ask a quick question of friam and get a quick answer.  The problem I so often face is WHAT QUESTION to ask the web, when I plunge into it.  I had gotten seduced by the dramatic metaphor of “scrape”; indeed, “migration” is a lot closer to what I am looking for.  These tips will help a lot and I will investigate them. 

 

Your mention of a web archive brought to mind another thought.  Years ago, I did up a website for the “City University of Santa Fe” which I thought was pretty nifty.  However, I was the only one who thought it was nifty, so in time even I lost interest.  And then I forgot to pay my fee to the hosting service, and they forgot to remind me, and I lost the site’s url to some outfit in Indiana.  I assumed I had lost the data too, but your email suggests the possibility that it still lives some where. 

 

Many, many thanks.

 

Nick

 

Nicholas S. Thompson

Emeritus Professor of Psychology and Biology

Clark University

http://home.earthlink.net/~nickthompson/naturaldesigns/

 

From: Friam [mailto:[hidden email]] On Behalf Of Robert J. Cordingley
Sent: Wednesday, January 04, 2017 12:00 AM
To: The Friday Morning Applied Complexity Coffee Group <[hidden email]>
Subject: Re: [FRIAM] scraping a web site

 

Hi Nick

Your old Earthlink site seems to comprise just about ten 'pages' of content, with many of those pages (Published Works) listing many bibliographic citations, each with a link to an image and further link to a pdf document. Grabbing all the content manually is perhaps tedious but doable. Saving all the pages as HTML is also doable but don't see a lot of point in that. Populating your Research Gate website should be possible too with in browser Copy and Paste - but I'm not familiar with RG - as should any other website builder, Wix, Squarespace, WordPress as well as hosting company  website builders. I don't know of an automated system but the Internet Archive must have something and already has multiple captures of past versions of your site - see https://web.archive.org/web/20151206005021/http://home.earthlink.net/~nickthompson/naturaldesigns/.

I think what you're really looking for is a web/content migration tool more so than web scraping tools which tend to be focused on capturing specific data, say contact information. Vamosa seems to offer a service that should do exactly what you want, see http://www.vamosa.com/vamosa-content-migrator-c124 but suspect that's aimed at large corporate clients. I have no experience with them. Googling 'website migration tools' produces lots of results - some questionable.

Hope this helps.

Thanks, Robert

 

On 1/3/17 9:49 PM, Nick Thompson wrote:

Dear Phellow Phriammers,

 

I am in the uncomfortable position of being bound by threads of steel to Earthlink.  Many, MANY, years I go I started a website on Earthlink, {http://home.earthlink.net/~nickthompson/naturaldesigns/

}, and put a lot of my writing, and some commentary up on it.  The website creation and editing medium (trellix) was pretty good for its time, and there are many ways that I find the site quite satisfying.  But gradually Earthlink has withdrawn its support, and now I am not sure I could get in to edit or change it.  Meantime, Research Gate has gotten started, and provides a somewhat better place to meet the world and archive my stuff.  And also, having the site on earthlink binds me to them and their 22 dollar a month fee.  So. …

 

I am wondering if there is a way (or a service that would) scrape the website and, possibly, dump it into a new and more reliable, more website creation medium?  Please, ambulatory knowledge only.  I don’t want a people doing deep searches to answer this  question . 

 

Thanks, as always .

 

Nick

 

Nicholas S. Thompson

Emeritus Professor of Psychology and Biology

Clark University

http://home.earthlink.net/~nickthompson/naturaldesigns/

 




============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove



-- 
Cirrillian 
Web Design & Development
Santa Fe, NM
http://cirrillian.com
281-989-6272 (cell)
Member Design Corps of Santa Fe

============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Barry MacKichan
Check with the NSA ;-)

--Barry


On 4 Jan 2017, at 0:20, Nick Thompson wrote:

> I assumed I had lost the data too, but your email suggests the
> possibility that it still lives some where.

============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

glen ep ropella
In reply to this post by Nick Thompson

Hey Nick,

I went ahead and downloaded your page(s) and put it up here:

  http://agent-based-modeling.com/ntnd/nickthompson/naturaldesigns/index.html

Let me know if I've missed anything.  I'm happy to help move it wherever.

-glen

On 01/03/2017 08:49 PM, Nick Thompson wrote:
> I am in the uncomfortable position of being bound by threads of steel to Earthlink.  Many, MANY, years I go I started a website on Earthlink, {http://home.earthlink.net/~nickthompson/naturaldesigns/ <http://home.earthlink.net/%7Enickthompson/naturaldesigns/>
>
> }, and put a lot of my writing, and some commentary up on it.  The website creation and editing medium (trellix) was pretty good for its time, and there are many ways that I find the site quite satisfying.  But gradually Earthlink has withdrawn its support, and now I am not sure I could get in to edit or change it.  Meantime, Research Gate has gotten started, and provides a somewhat better place to meet the world and archive my stuff.  And also, having the site on earthlink binds me to them and their 22 dollar a month fee.  So. …
>
>  
>
> I am wondering if there is a way (or a service that would) scrape the website and, possibly, dump it into a new and more reliable, more website creation medium?  Please, ambulatory knowledge only.  I don’t want a people doing deep searches to answer this  question .


--
glen ep ropella ⊥ 971-280-5699

============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Owen Densmore
Administrator
webget and sitesucker worked great for me. And once I had to use the internet archive and it was a huge win, whew!

On Wed, Jan 4, 2017 at 10:28 AM, glen ep ropella <[hidden email]> wrote:

Hey Nick,

I went ahead and downloaded your page(s) and put it up here:

  http://agent-based-modeling.com/ntnd/nickthompson/naturaldesigns/index.html

Let me know if I've missed anything.  I'm happy to help move it wherever.

-glen

On 01/03/2017 08:49 PM, Nick Thompson wrote:
> I am in the uncomfortable position of being bound by threads of steel to Earthlink.  Many, MANY, years I go I started a website on Earthlink, {http://home.earthlink.net/~nickthompson/naturaldesigns/ <http://home.earthlink.net/%7Enickthompson/naturaldesigns/>
>
> }, and put a lot of my writing, and some commentary up on it.  The website creation and editing medium (trellix) was pretty good for its time, and there are many ways that I find the site quite satisfying.  But gradually Earthlink has withdrawn its support, and now I am not sure I could get in to edit or change it.  Meantime, Research Gate has gotten started, and provides a somewhat better place to meet the world and archive my stuff.  And also, having the site on earthlink binds me to them and their 22 dollar a month fee.  So. …
>
>
>
> I am wondering if there is a way (or a service that would) scrape the website and, possibly, dump it into a new and more reliable, more website creation medium?  Please, ambulatory knowledge only.  I don’t want a people doing deep searches to answer this  question .


--
glen ep ropella ⊥ <a href="tel:971-280-5699" value="+19712805699">971-280-5699

============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Nick Thompson
In reply to this post by glen ep ropella

My God, Glen,

 

Freedom!!!!!!!!!!!!!!!!

Now I have to think what I want to do next.  In some ideal world, I would sign up for one of those websites where for not too much money you can edit a web site, and bring the old site in and start working on it.  But surely life is not that easy, right?

 

This is amazing.  I never thought I would escape Earthlink.  They have the worst help system … the kind where you have to repeat your question over and over again.  I am breathing deeply. 

 

Nick

 

Nicholas S. Thompson

Emeritus Professor of Psychology and Biology

Clark University

http://home.earthlink.net/~nickthompson/naturaldesigns/

 

 

-----Original Message-----
From: Friam [mailto:[hidden email]] On Behalf Of glen ep ropella
Sent: Wednesday, January 04, 2017 10:29 AM
To: The Friday Morning Applied Complexity Coffee Group <[hidden email]>
Subject: Re: [FRIAM] scraping a web site

 

 

Hey Nick,

 

I went ahead and downloaded your page(s) and put it up here:

 

  http://agent-based-modeling.com/ntnd/nickthompson/naturaldesigns/index.html

 

Let me know if I've missed anything.  I'm happy to help move it wherever.

 

-glen

 

On 01/03/2017 08:49 PM, Nick Thompson wrote:

> I am in the uncomfortable position of being bound by threads of steel

> to Earthlink.  Many, MANY, years I go I started a website on

> Earthlink, {http://home.earthlink.net/~nickthompson/naturaldesigns/

> <http://home.earthlink.net/%7Enickthompson/naturaldesigns/>

>

> }, and put a lot of my writing, and some commentary up on it.  The

> website creation and editing medium (trellix) was pretty good for its

> time, and there are many ways that I find the site quite satisfying. 

> But gradually Earthlink has withdrawn its support, and now I am not

> sure I could get in to edit or change it.  Meantime, Research Gate has

> gotten started, and provides a somewhat better place to meet the world

> and archive my stuff.  And also, having the site on earthlink binds me

> to them and their 22 dollar a month fee.  So. …

>

>

> I am wondering if there is a way (or a service that would) scrape the website and, possibly, dump it into a new and more reliable, more website creation medium?  Please, ambulatory knowledge only.  I don’t want a people doing deep searches to answer this  question .

 

 

--

glen ep ropella 971-280-5699

 

============================================================

FRIAM Applied Complexity Group listserv

Meets Fridays 9a-11:30 at cafe at St. John's College to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com

FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Gillian Densmore
In reply to this post by glen ep ropella
Nick yes it called giving me a call. I don't know how you set it up.  In short it's a very routine thing to do. Wit
I'd do it for you for free because I consider you a friend.

Thier's always some Oops when you do so. And it depends on how you, fam, and or friends set it up. I

-Wordpress (for example) now has some rocking tools specifically for moving between Weby Web  hosts.

-Basic HTML is pretty portable, But again it depends entirely on how you or a friend set it up.

If you can oldschool FTP to earthlink just download it. From experience it's a good idea to start the new web-host upload it, to make sure you didn't miss something. 

Robert Corginger hit the nail on the head. In simple speak Web-making-tools such as the OG DreamWeaver and Wordpress (as of 4.6) have a variety of tools to move it around. I swear by Database->Files Folder Plugins for wordpress for that reason. 
But needless to say you do have options. 

As to what host? That's a matter of taste and how much you want to play Admin.  Like I said to Steve. I LOVE to have Wordpress on Gilsplace.net just to keep up with trying out setting up  or keeping up a website.
 BUT for just a quick article or rave or geeking out?
Wordpress.com all theway. Wix and Weebly (however that's spelled) are a close tie though.
Because they just work. 
For what it's worth ipage.net AmazonCloud,  get rave reviews. namecheap in my experience isn't to bad. help is a totall crap shoot though.




On Wed, Jan 4, 2017 at 10:28 AM, glen ep ropella <[hidden email]> wrote:

Hey Nick,

I went ahead and downloaded your page(s) and put it up here:

  http://agent-based-modeling.com/ntnd/nickthompson/naturaldesigns/index.html

Let me know if I've missed anything.  I'm happy to help move it wherever.

-glen

On 01/03/2017 08:49 PM, Nick Thompson wrote:
> I am in the uncomfortable position of being bound by threads of steel to Earthlink.  Many, MANY, years I go I started a website on Earthlink, {http://home.earthlink.net/~nickthompson/naturaldesigns/ <http://home.earthlink.net/%7Enickthompson/naturaldesigns/>
>
> }, and put a lot of my writing, and some commentary up on it.  The website creation and editing medium (trellix) was pretty good for its time, and there are many ways that I find the site quite satisfying.  But gradually Earthlink has withdrawn its support, and now I am not sure I could get in to edit or change it.  Meantime, Research Gate has gotten started, and provides a somewhat better place to meet the world and archive my stuff.  And also, having the site on earthlink binds me to them and their 22 dollar a month fee.  So. …
>
>
>
> I am wondering if there is a way (or a service that would) scrape the website and, possibly, dump it into a new and more reliable, more website creation medium?  Please, ambulatory knowledge only.  I don’t want a people doing deep searches to answer this  question .


--
glen ep ropella ⊥ <a href="tel:971-280-5699" value="+19712805699">971-280-5699

============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Gillian Densmore
Nick take a look at these guys: I have had some clients try them, They have zero interestest in learning all the quirks of wordpress and just wanted and I quote "A fucking awsome website that just fucking works!" They loved these:

https://www.weebly.com

www.wix.com/




On Wed, Jan 4, 2017 at 11:10 AM, Gillian Densmore <[hidden email]> wrote:
Nick yes it called giving me a call. I don't know how you set it up.  In short it's a very routine thing to do. Wit
I'd do it for you for free because I consider you a friend.

Thier's always some Oops when you do so. And it depends on how you, fam, and or friends set it up. I

-Wordpress (for example) now has some rocking tools specifically for moving between Weby Web  hosts.

-Basic HTML is pretty portable, But again it depends entirely on how you or a friend set it up.

If you can oldschool FTP to earthlink just download it. From experience it's a good idea to start the new web-host upload it, to make sure you didn't miss something. 

Robert Corginger hit the nail on the head. In simple speak Web-making-tools such as the OG DreamWeaver and Wordpress (as of 4.6) have a variety of tools to move it around. I swear by Database->Files Folder Plugins for wordpress for that reason. 
But needless to say you do have options. 

As to what host? That's a matter of taste and how much you want to play Admin.  Like I said to Steve. I LOVE to have Wordpress on Gilsplace.net just to keep up with trying out setting up  or keeping up a website.
 BUT for just a quick article or rave or geeking out?
Wordpress.com all theway. Wix and Weebly (however that's spelled) are a close tie though.
Because they just work. 
For what it's worth ipage.net AmazonCloud,  get rave reviews. namecheap in my experience isn't to bad. help is a totall crap shoot though.




On Wed, Jan 4, 2017 at 10:28 AM, glen ep ropella <[hidden email]> wrote:

Hey Nick,

I went ahead and downloaded your page(s) and put it up here:

  http://agent-based-modeling.com/ntnd/nickthompson/naturaldesigns/index.html

Let me know if I've missed anything.  I'm happy to help move it wherever.

-glen

On 01/03/2017 08:49 PM, Nick Thompson wrote:
> I am in the uncomfortable position of being bound by threads of steel to Earthlink.  Many, MANY, years I go I started a website on Earthlink, {http://home.earthlink.net/~nickthompson/naturaldesigns/ <http://home.earthlink.net/%7Enickthompson/naturaldesigns/>
>
> }, and put a lot of my writing, and some commentary up on it.  The website creation and editing medium (trellix) was pretty good for its time, and there are many ways that I find the site quite satisfying.  But gradually Earthlink has withdrawn its support, and now I am not sure I could get in to edit or change it.  Meantime, Research Gate has gotten started, and provides a somewhat better place to meet the world and archive my stuff.  And also, having the site on earthlink binds me to them and their 22 dollar a month fee.  So. …
>
>
>
> I am wondering if there is a way (or a service that would) scrape the website and, possibly, dump it into a new and more reliable, more website creation medium?  Please, ambulatory knowledge only.  I don’t want a people doing deep searches to answer this  question .


--
glen ep ropella ⊥ <a href="tel:971-280-5699" value="+19712805699" target="_blank">971-280-5699

============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove



============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Nick Thompson
In reply to this post by Gillian Densmore

Hi gil,

 

A very kind offer.  I am a bit overwhelmed by all of this.  Didn’t expect such an array of info so fast, given how slow things have been in the past.

 

I have to read through everything carefully and think what to do next.

 

Take care,

 

Nick

 

Nicholas S. Thompson

Emeritus Professor of Psychology and Biology

Clark University

http://home.earthlink.net/~nickthompson/naturaldesigns/

 

From: Friam [mailto:[hidden email]] On Behalf Of Gillian Densmore
Sent: Wednesday, January 04, 2017 11:11 AM
To: The Friday Morning Applied Complexity Coffee Group <[hidden email]>
Subject: Re: [FRIAM] scraping a web site

 

Nick yes it called giving me a call. I don't know how you set it up.  In short it's a very routine thing to do. Wit

I'd do it for you for free because I consider you a friend.

 

Thier's always some Oops when you do so. And it depends on how you, fam, and or friends set it up. I

 

-Wordpress (for example) now has some rocking tools specifically for moving between Weby Web  hosts.

 

-Basic HTML is pretty portable, But again it depends entirely on how you or a friend set it up.

 

If you can oldschool FTP to earthlink just download it. From experience it's a good idea to start the new web-host upload it, to make sure you didn't miss something. 

 

Robert Corginger hit the nail on the head. In simple speak Web-making-tools such as the OG DreamWeaver and Wordpress (as of 4.6) have a variety of tools to move it around. I swear by Database->Files Folder Plugins for wordpress for that reason. 

But needless to say you do have options. 

 

As to what host? That's a matter of taste and how much you want to play Admin.  Like I said to Steve. I LOVE to have Wordpress on Gilsplace.net just to keep up with trying out setting up  or keeping up a website.

 BUT for just a quick article or rave or geeking out?

Wordpress.com all theway. Wix and Weebly (however that's spelled) are a close tie though.

Because they just work. 

For what it's worth ipage.net AmazonCloud,  get rave reviews. namecheap in my experience isn't to bad. help is a totall crap shoot though.

 

 

 

 

On Wed, Jan 4, 2017 at 10:28 AM, glen ep ropella <[hidden email]> wrote:


Hey Nick,

I went ahead and downloaded your page(s) and put it up here:

  http://agent-based-modeling.com/ntnd/nickthompson/naturaldesigns/index.html

Let me know if I've missed anything.  I'm happy to help move it wherever.

-glen

On 01/03/2017 08:49 PM, Nick Thompson wrote:
> I am in the uncomfortable position of being bound by threads of steel to Earthlink.  Many, MANY, years I go I started a website on Earthlink, {http://home.earthlink.net/~nickthompson/naturaldesigns/ <http://home.earthlink.net/%7Enickthompson/naturaldesigns/>
>
> }, and put a lot of my writing, and some commentary up on it.  The website creation and editing medium (trellix) was pretty good for its time, and there are many ways that I find the site quite satisfying.  But gradually Earthlink has withdrawn its support, and now I am not sure I could get in to edit or change it.  Meantime, Research Gate has gotten started, and provides a somewhat better place to meet the world and archive my stuff.  And also, having the site on earthlink binds me to them and their 22 dollar a month fee.  So. …
>
>
>
> I am wondering if there is a way (or a service that would) scrape the website and, possibly, dump it into a new and more reliable, more website creation medium?  Please, ambulatory knowledge only.  I don’t want a people doing deep searches to answer this  question .


--
glen ep ropella
<a href="tel:971-280-5699">971-280-5699


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove

 


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Grant Holland
In reply to this post by Nick Thompson

Cool move, Glen.


On 1/4/17 11:05 AM, Nick Thompson wrote:

My God, Glen,

 

Freedom!!!!!!!!!!!!!!!!

Now I have to think what I want to do next.  In some ideal world, I would sign up for one of those websites where for not too much money you can edit a web site, and bring the old site in and start working on it.  But surely life is not that easy, right?

 

This is amazing.  I never thought I would escape Earthlink.  They have the worst help system … the kind where you have to repeat your question over and over again.  I am breathing deeply. 

 

Nick

 

Nicholas S. Thompson

Emeritus Professor of Psychology and Biology

Clark University

http://home.earthlink.net/~nickthompson/naturaldesigns/

 

 

-----Original Message-----
From: Friam [[hidden email]] On Behalf Of glen ep ropella
Sent: Wednesday, January 04, 2017 10:29 AM
To: The Friday Morning Applied Complexity Coffee Group [hidden email]
Subject: Re: [FRIAM] scraping a web site

 

 

Hey Nick,

 

I went ahead and downloaded your page(s) and put it up here:

 

  http://agent-based-modeling.com/ntnd/nickthompson/naturaldesigns/index.html

 

Let me know if I've missed anything.  I'm happy to help move it wherever.

 

-glen

 

On 01/03/2017 08:49 PM, Nick Thompson wrote:

> I am in the uncomfortable position of being bound by threads of steel

> to Earthlink.  Many, MANY, years I go I started a website on

> Earthlink, {http://home.earthlink.net/~nickthompson/naturaldesigns/

> <http://home.earthlink.net/%7Enickthompson/naturaldesigns/>

>

> }, and put a lot of my writing, and some commentary up on it.  The

> website creation and editing medium (trellix) was pretty good for its

> time, and there are many ways that I find the site quite satisfying. 

> But gradually Earthlink has withdrawn its support, and now I am not

> sure I could get in to edit or change it.  Meantime, Research Gate has

> gotten started, and provides a somewhat better place to meet the world

> and archive my stuff.  And also, having the site on earthlink binds me

> to them and their 22 dollar a month fee.  So. …

>

>

> I am wondering if there is a way (or a service that would) scrape the website and, possibly, dump it into a new and more reliable, more website creation medium?  Please, ambulatory knowledge only.  I don’t want a people doing deep searches to answer this  question .

 

 

--

glen ep ropella 971-280-5699

 

============================================================

FRIAM Applied Complexity Group listserv

Meets Fridays 9a-11:30 at cafe at St. John's College to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com

FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove



============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Barry MacKichan
In reply to this post by Nick Thompson

Squarespace (https://www.squarespace.com) has a good reputation.
If your site doesn’t require any code on the back end (and I would guess it doesn’t, given its age) you could put it on Amazon S3, so you pay only for the storage space (a few cents per gigabyte).

--Barry


On 3 Jan 2017, at 21:49, Nick Thompson wrote:

Dear Phellow Phriammers,

 

I am in the uncomfortable position of being bound by threads of steel to Earthlink.  Many, MANY, years I go I started a website on Earthlink, {http://home.earthlink.net/~nickthompson/naturaldesigns/

}, and put a lot of my writing, and some commentary up on it.  The website creation and editing medium (trellix) was pretty good for its time, and there are many ways that I find the site quite satisfying.  But gradually Earthlink has withdrawn its support, and now I am not sure I could get in to edit or change it.  Meantime, Research Gate has gotten started, and provides a somewhat better place to meet the world and archive my stuff.  And also, having the site on earthlink binds me to them and their 22 dollar a month fee.  So. …

 

I am wondering if there is a way (or a service that would) scrape the website and, possibly, dump it into a new and more reliable, more website creation medium?  Please, ambulatory knowledge only.  I don’t want a people doing deep searches to answer this  question . 

 

Thanks, as always .

 

Nick

 

Nicholas S. Thompson

Emeritus Professor of Psychology and Biology

Clark University

http://home.earthlink.net/~nickthompson/naturaldesigns/

 

============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Owen Densmore
Administrator
In reply to this post by Gillian Densmore
Hi Nick, glad you recovered your site.

In terms of going forward, there are lots of changes in the blog-o-sphere. Two things to consider:

1 - Content: I.e. what do you want your site to be? Is it mainly "static pages" or does it need bells and whistles like wordpress or other CMS's (Content Management Systems) supply. You might consider Markdown rather than an HTML editor. It is a simple text format that uses a specific format to build pages.

2 - Deployment: I.e. how do you get it into the sky! Hosting services are a dime a dozen and many of them are OK. They are fairly easy to use via a "dashboard". There are several free stunts too, like github pages, zeit now, medium.com and so on.

The latter free sites generally solve both issues but then you have to play their game.

On Wed, Jan 4, 2017 at 11:10 AM, Gillian Densmore <[hidden email]> wrote:
Nick yes it called giving me a call. I don't know how you set it up.  In short it's a very routine thing to do. Wit
I'd do it for you for free because I consider you a friend.

Thier's always some Oops when you do so. And it depends on how you, fam, and or friends set it up. I

-Wordpress (for example) now has some rocking tools specifically for moving between Weby Web  hosts.

-Basic HTML is pretty portable, But again it depends entirely on how you or a friend set it up.

If you can oldschool FTP to earthlink just download it. From experience it's a good idea to start the new web-host upload it, to make sure you didn't miss something. 

Robert Corginger hit the nail on the head. In simple speak Web-making-tools such as the OG DreamWeaver and Wordpress (as of 4.6) have a variety of tools to move it around. I swear by Database->Files Folder Plugins for wordpress for that reason. 
But needless to say you do have options. 

As to what host? That's a matter of taste and how much you want to play Admin.  Like I said to Steve. I LOVE to have Wordpress on Gilsplace.net just to keep up with trying out setting up  or keeping up a website.
 BUT for just a quick article or rave or geeking out?
Wordpress.com all theway. Wix and Weebly (however that's spelled) are a close tie though.
Because they just work. 
For what it's worth ipage.net AmazonCloud,  get rave reviews. namecheap in my experience isn't to bad. help is a totall crap shoot though.




On Wed, Jan 4, 2017 at 10:28 AM, glen ep ropella <[hidden email]> wrote:

Hey Nick,

I went ahead and downloaded your page(s) and put it up here:

  http://agent-based-modeling.com/ntnd/nickthompson/naturaldesigns/index.html

Let me know if I've missed anything.  I'm happy to help move it wherever.

-glen

On 01/03/2017 08:49 PM, Nick Thompson wrote:
> I am in the uncomfortable position of being bound by threads of steel to Earthlink.  Many, MANY, years I go I started a website on Earthlink, {http://home.earthlink.net/~nickthompson/naturaldesigns/ <http://home.earthlink.net/%7Enickthompson/naturaldesigns/>
>
> }, and put a lot of my writing, and some commentary up on it.  The website creation and editing medium (trellix) was pretty good for its time, and there are many ways that I find the site quite satisfying.  But gradually Earthlink has withdrawn its support, and now I am not sure I could get in to edit or change it.  Meantime, Research Gate has gotten started, and provides a somewhat better place to meet the world and archive my stuff.  And also, having the site on earthlink binds me to them and their 22 dollar a month fee.  So. …
>
>
>
> I am wondering if there is a way (or a service that would) scrape the website and, possibly, dump it into a new and more reliable, more website creation medium?  Please, ambulatory knowledge only.  I don’t want a people doing deep searches to answer this  question .


--
glen ep ropella ⊥ <a href="tel:971-280-5699" value="+19712805699" target="_blank">971-280-5699

============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Robert J. Cordingley
In reply to this post by glen ep ropella
Hi Glen

So I'm curious, what tools did you use to do Nick's migration?

Thanks, Robert


On 1/4/17 10:28 AM, glen ep ropella wrote:

> Hey Nick,
>
> I went ahead and downloaded your page(s) and put it up here:
>
>    http://agent-based-modeling.com/ntnd/nickthompson/naturaldesigns/index.html
>
> Let me know if I've missed anything.  I'm happy to help move it wherever.
>
> -glen
>
> On 01/03/2017 08:49 PM, Nick Thompson wrote:
>> I am in the uncomfortable position of being bound by threads of steel to Earthlink.  Many, MANY, years I go I started a website on Earthlink, {http://home.earthlink.net/~nickthompson/naturaldesigns/ <http://home.earthlink.net/%7Enickthompson/naturaldesigns/>
>>
>> }, and put a lot of my writing, and some commentary up on it.  The website creation and editing medium (trellix) was pretty good for its time, and there are many ways that I find the site quite satisfying.  But gradually Earthlink has withdrawn its support, and now I am not sure I could get in to edit or change it.  Meantime, Research Gate has gotten started, and provides a somewhat better place to meet the world and archive my stuff.  And also, having the site on earthlink binds me to them and their 22 dollar a month fee.  So. …
>>
>>  
>>
>> I am wondering if there is a way (or a service that would) scrape the website and, possibly, dump it into a new and more reliable, more website creation medium?  Please, ambulatory knowledge only.  I don’t want a people doing deep searches to answer this  question .
>

--
Cirrillian
Web Design & Development
Santa Fe, NM
http://cirrillian.com
281-989-6272 (cell)
Member Design Corps of Santa Fe


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Robert Wall
In reply to this post by Owen Densmore
Nick,

A lot of good ideas here.  I will just add one more.  For what you are doing, from what I can tell, you may do well with a so-called non-database Content Management System like Kirby.  There are scores of these out there now, many being free.  I have some experience with Kirby.  It was just $40 ($17 now I think) and very easy to use and very good support.  It is file-folder based, very intuitive and has been around for a while. Essentially, being file-based, if you can build a Windows directory, you can build a decent website.  They have some examples of some very minimalist ways to go.

As Owen say, you do need to figure out how you are going to get it into the sky, so to speak. I have been using HostGtor for years, but there are others.

Glen,

A good steer to the File Manager plugins to get to the backend from the Admin Panel in WordPress.   I installed WP File Manager (free), but the Pro version looks like what one really needs if, say, you want to do some code editing. This is useful.

On Wed, Jan 4, 2017 at 1:15 PM, Owen Densmore <[hidden email]> wrote:
Hi Nick, glad you recovered your site.

In terms of going forward, there are lots of changes in the blog-o-sphere. Two things to consider:

1 - Content: I.e. what do you want your site to be? Is it mainly "static pages" or does it need bells and whistles like wordpress or other CMS's (Content Management Systems) supply. You might consider Markdown rather than an HTML editor. It is a simple text format that uses a specific format to build pages.

2 - Deployment: I.e. how do you get it into the sky! Hosting services are a dime a dozen and many of them are OK. They are fairly easy to use via a "dashboard". There are several free stunts too, like github pages, zeit now, medium.com and so on.

The latter free sites generally solve both issues but then you have to play their game.

On Wed, Jan 4, 2017 at 11:10 AM, Gillian Densmore <[hidden email]> wrote:
Nick yes it called giving me a call. I don't know how you set it up.  In short it's a very routine thing to do. Wit
I'd do it for you for free because I consider you a friend.

Thier's always some Oops when you do so. And it depends on how you, fam, and or friends set it up. I

-Wordpress (for example) now has some rocking tools specifically for moving between Weby Web  hosts.

-Basic HTML is pretty portable, But again it depends entirely on how you or a friend set it up.

If you can oldschool FTP to earthlink just download it. From experience it's a good idea to start the new web-host upload it, to make sure you didn't miss something. 

Robert Corginger hit the nail on the head. In simple speak Web-making-tools such as the OG DreamWeaver and Wordpress (as of 4.6) have a variety of tools to move it around. I swear by Database->Files Folder Plugins for wordpress for that reason. 
But needless to say you do have options. 

As to what host? That's a matter of taste and how much you want to play Admin.  Like I said to Steve. I LOVE to have Wordpress on Gilsplace.net just to keep up with trying out setting up  or keeping up a website.
 BUT for just a quick article or rave or geeking out?
Wordpress.com all theway. Wix and Weebly (however that's spelled) are a close tie though.
Because they just work. 
For what it's worth ipage.net AmazonCloud,  get rave reviews. namecheap in my experience isn't to bad. help is a totall crap shoot though.




On Wed, Jan 4, 2017 at 10:28 AM, glen ep ropella <[hidden email]> wrote:

Hey Nick,

I went ahead and downloaded your page(s) and put it up here:

  http://agent-based-modeling.com/ntnd/nickthompson/naturaldesigns/index.html

Let me know if I've missed anything.  I'm happy to help move it wherever.

-glen

On 01/03/2017 08:49 PM, Nick Thompson wrote:
> I am in the uncomfortable position of being bound by threads of steel to Earthlink.  Many, MANY, years I go I started a website on Earthlink, {http://home.earthlink.net/~nickthompson/naturaldesigns/ <http://home.earthlink.net/%7Enickthompson/naturaldesigns/>
>
> }, and put a lot of my writing, and some commentary up on it.  The website creation and editing medium (trellix) was pretty good for its time, and there are many ways that I find the site quite satisfying.  But gradually Earthlink has withdrawn its support, and now I am not sure I could get in to edit or change it.  Meantime, Research Gate has gotten started, and provides a somewhat better place to meet the world and archive my stuff.  And also, having the site on earthlink binds me to them and their 22 dollar a month fee.  So. …
>
>
>
> I am wondering if there is a way (or a service that would) scrape the website and, possibly, dump it into a new and more reliable, more website creation medium?  Please, ambulatory knowledge only.  I don’t want a people doing deep searches to answer this  question .


--
glen ep ropella ⊥ <a href="tel:971-280-5699" value="+19712805699" target="_blank">971-280-5699

============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

glen ep ropella
In reply to this post by Robert J. Cordingley

$ wget -k -r http://home.earthlink.net/%7Enickthompson/naturaldesigns/

On 01/04/2017 12:16 PM, Robert J. Cordingley wrote:
> So I'm curious, what tools did you use to do Nick's migration?

--
glen ep ropella ⊥ 971-280-5699

============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Marcus G. Daniels
In reply to this post by Nick Thompson

Glen, is this like a `free’ signup to Hulu, right?   Cancel now, or expect an invoice? 

 

From: Friam [mailto:[hidden email]] On Behalf Of Nick Thompson
Sent: Wednesday, January 04, 2017 11:05 AM
To: 'The Friday Morning Applied Complexity Coffee Group' <[hidden email]>
Subject: Re: [FRIAM] scraping a web site

 

My God, Glen,

 

Freedom!!!!!!!!!!!!!!!!

Now I have to think what I want to do next.  In some ideal world, I would sign up for one of those websites where for not too much money you can edit a web site, and bring the old site in and start working on it.  But surely life is not that easy, right?

 

This is amazing.  I never thought I would escape Earthlink.  They have the worst help system … the kind where you have to repeat your question over and over again.  I am breathing deeply. 

 

Nick

 

Nicholas S. Thompson

Emeritus Professor of Psychology and Biology

Clark University

http://home.earthlink.net/~nickthompson/naturaldesigns/

 

 

-----Original Message-----
From: Friam [[hidden email]] On Behalf Of glen ep ropella
Sent: Wednesday, January 04, 2017 10:29 AM
To: The Friday Morning Applied Complexity Coffee Group <[hidden email]>
Subject: Re: [FRIAM] scraping a web site

 

 

Hey Nick,

 

I went ahead and downloaded your page(s) and put it up here:

 

  http://agent-based-modeling.com/ntnd/nickthompson/naturaldesigns/index.html

 

Let me know if I've missed anything.  I'm happy to help move it wherever.

 

-glen

 

On 01/03/2017 08:49 PM, Nick Thompson wrote:

> I am in the uncomfortable position of being bound by threads of steel

> to Earthlink.  Many, MANY, years I go I started a website on

> Earthlink, {http://home.earthlink.net/~nickthompson/naturaldesigns/

> <http://home.earthlink.net/%7Enickthompson/naturaldesigns/>

>

> }, and put a lot of my writing, and some commentary up on it.  The

> website creation and editing medium (trellix) was pretty good for its

> time, and there are many ways that I find the site quite satisfying. 

> But gradually Earthlink has withdrawn its support, and now I am not

> sure I could get in to edit or change it.  Meantime, Research Gate has

> gotten started, and provides a somewhat better place to meet the world

> and archive my stuff.  And also, having the site on earthlink binds me

> to them and their 22 dollar a month fee.  So. …

>

>

> I am wondering if there is a way (or a service that would) scrape the website and, possibly, dump it into a new and more reliable, more website creation medium?  Please, ambulatory knowledge only.  I don’t want a people doing deep searches to answer this  question .

 

 

--

glen ep ropella 971-280-5699

 

============================================================

FRIAM Applied Complexity Group listserv

Meets Fridays 9a-11:30 at cafe at St. John's College to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com

FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

gepr

Mwahahahahah! [wrings hands]

No.  I just hate the way everyone tries to make money off what should be infrastructure.  Everyone should get their own website.  That they control entirely.  I keep intending to set up a permanent one for myself on IPFS <https://ipfs.io/, https://github.com/ipfs/ipfs>.  But I'm just too lazy.

For the record, I told Nick I'll host it until he figures out what he wants to do long-term.

On 01/04/2017 12:33 PM, Marcus Daniels wrote:
> Glen, is this like a `free’ signup to Hulu, right?   Cancel now, or expect an invoice?

--
☣ glen

============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
uǝʃƃ ⊥ glen
12