scraping a web site

classic Classic list List threaded Threaded
28 messages Options
12
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Marcus G. Daniels

Ah, like it's private storage sibling

 

 

-----Original Message-----
From: Friam [mailto:[hidden email]] On Behalf Of glen ?
Sent: Wednesday, January 04, 2017 1:41 PM
To: The Friday Morning Applied Complexity Coffee Group <[hidden email]>
Subject: Re: [FRIAM] scraping a web site

 

 

Mwahahahahah! [wrings hands]

 

No.  I just hate the way everyone tries to make money off what should be infrastructure.  Everyone should get their own website.  That they control entirely.  I keep intending to set up a permanent one for myself on IPFS <https://ipfs.io/, https://github.com/ipfs/ipfs>.  But I'm just too lazy.

 

For the record, I told Nick I'll host it until he figures out what he wants to do long-term.

 

On 01/04/2017 12:33 PM, Marcus Daniels wrote:

> Glen, is this like a `free’ signup to Hulu, right?   Cancel now, or expect an invoice?

 

--

glen

 

============================================================

FRIAM Applied Complexity Group listserv

Meets Fridays 9a-11:30 at cafe at St. John's College to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com

FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Owen Densmore
Administrator
In reply to this post by gepr
I'd suggest the first step is getting a Domain Name. Then pointing it to earthlink's email but telling everyone the DNS name's pointer. So nicksplace.net say. Then put the existing pages where-every you'd like, again pointing to them with Nicks domain.

Then any time you want to move to a new service for email or blog, it is really easy .. kinda like having a permanent phone number that never changes. 

I've had backspaces.net forever it seems and have moved between providers tens of times and no changes required in terms of addresses.


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Nick Thompson
In reply to this post by Barry MacKichan

Thanks, Barry.  Fabulous.  N

 

Nicholas S. Thompson

Emeritus Professor of Psychology and Biology

Clark University

http://home.earthlink.net/~nickthompson/naturaldesigns/

 

From: Friam [mailto:[hidden email]] On Behalf Of Barry MacKichan
Sent: Wednesday, January 04, 2017 12:07 PM
To: The Friday Morning Applied Complexity Coffee Group <[hidden email]>
Subject: Re: [FRIAM] scraping a web site

 

Squarespace (https://www.squarespace.com) has a good reputation.
If your site doesn’t require any code on the back end (and I would guess it doesn’t, given its age) you could put it on Amazon S3, so you pay only for the storage space (a few cents per gigabyte).

--Barry

 

On 3 Jan 2017, at 21:49, Nick Thompson wrote:

Dear Phellow Phriammers,

 

I am in the uncomfortable position of being bound by threads of steel to Earthlink.  Many, MANY, years I go I started a website on Earthlink, {http://home.earthlink.net/~nickthompson/naturaldesigns/

}, and put a lot of my writing, and some commentary up on it.  The website creation and editing medium (trellix) was pretty good for its time, and there are many ways that I find the site quite satisfying.  But gradually Earthlink has withdrawn its support, and now I am not sure I could get in to edit or change it.  Meantime, Research Gate has gotten started, and provides a somewhat better place to meet the world and archive my stuff.  And also, having the site on earthlink binds me to them and their 22 dollar a month fee.  So. …

 

I am wondering if there is a way (or a service that would) scrape the website and, possibly, dump it into a new and more reliable, more website creation medium?  Please, ambulatory knowledge only.  I don’t want a people doing deep searches to answer this  question . 

 

Thanks, as always .

 

Nick

 

Nicholas S. Thompson

Emeritus Professor of Psychology and Biology

Clark University

http://home.earthlink.net/~nickthompson/naturaldesigns/

 

============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Gillian Densmore
Kirby eh? it was kind of quirk when I tried for what's worth. 

On Wed, Jan 4, 2017 at 1:55 PM, Nick Thompson <[hidden email]> wrote:

Thanks, Barry.  Fabulous.  N

 

Nicholas S. Thompson

Emeritus Professor of Psychology and Biology

Clark University

http://home.earthlink.net/~nickthompson/naturaldesigns/

 

From: Friam [mailto:[hidden email]] On Behalf Of Barry MacKichan
Sent: Wednesday, January 04, 2017 12:07 PM
To: The Friday Morning Applied Complexity Coffee Group <[hidden email]>
Subject: Re: [FRIAM] scraping a web site

 

Squarespace (https://www.squarespace.com) has a good reputation.


If your site doesn’t require any code on the back end (and I would guess it doesn’t, given its age) you could put it on Amazon S3, so you pay only for the storage space (a few cents per gigabyte).

--Barry

 

On 3 Jan 2017, at 21:49, Nick Thompson wrote:

Dear Phellow Phriammers,

 

I am in the uncomfortable position of being bound by threads of steel to Earthlink.  Many, MANY, years I go I started a website on Earthlink, {http://home.earthlink.net/~nickthompson/naturaldesigns/

}, and put a lot of my writing, and some commentary up on it.  The website creation and editing medium (trellix) was pretty good for its time, and there are many ways that I find the site quite satisfying.  But gradually Earthlink has withdrawn its support, and now I am not sure I could get in to edit or change it.  Meantime, Research Gate has gotten started, and provides a somewhat better place to meet the world and archive my stuff.  And also, having the site on earthlink binds me to them and their 22 dollar a month fee.  So. …

 

I am wondering if there is a way (or a service that would) scrape the website and, possibly, dump it into a new and more reliable, more website creation medium?  Please, ambulatory knowledge only.  I don’t want a people doing deep searches to answer this  question . 

 

Thanks, as always .

 

Nick

 

Nicholas S. Thompson

Emeritus Professor of Psychology and Biology

Clark University

http://home.earthlink.net/~nickthompson/naturaldesigns/

 

============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Gary Schiltz-4
As if Nick's head isn't already spinning from all the advice, one thing I would mention is just how big a share of web access these days is via mobile devices, i.e. smartphones and tablets. According to some (reputable?) sources, glean from a single Google search, mobile use has overtaken desktop use (desktop includes laptops). If true, then is a good idea to design new web sites with this in mind, and use a platform that supports "responsive" design, where the platform detects the capabilities of the user's browser and formats pages accordingly. I haven't dug into Nick's site yet, so I don't how amenable the pages are to formatting for a four inch screen :-)

On Wed, Jan 4, 2017 at 5:03 PM, Gillian Densmore <[hidden email]> wrote:
Kirby eh? it was kind of quirk when I tried for what's worth. 

On Wed, Jan 4, 2017 at 1:55 PM, Nick Thompson <[hidden email]> wrote:

Thanks, Barry.  Fabulous.  N

 

Nicholas S. Thompson

Emeritus Professor of Psychology and Biology

Clark University

http://home.earthlink.net/~nickthompson/naturaldesigns/

 

From: Friam [mailto:[hidden email]] On Behalf Of Barry MacKichan
Sent: Wednesday, January 04, 2017 12:07 PM
To: The Friday Morning Applied Complexity Coffee Group <[hidden email]>
Subject: Re: [FRIAM] scraping a web site

 

Squarespace (https://www.squarespace.com) has a good reputation.


If your site doesn’t require any code on the back end (and I would guess it doesn’t, given its age) you could put it on Amazon S3, so you pay only for the storage space (a few cents per gigabyte).

--Barry

 

On 3 Jan 2017, at 21:49, Nick Thompson wrote:

Dear Phellow Phriammers,

 

I am in the uncomfortable position of being bound by threads of steel to Earthlink.  Many, MANY, years I go I started a website on Earthlink, {http://home.earthlink.net/~nickthompson/naturaldesigns/

}, and put a lot of my writing, and some commentary up on it.  The website creation and editing medium (trellix) was pretty good for its time, and there are many ways that I find the site quite satisfying.  But gradually Earthlink has withdrawn its support, and now I am not sure I could get in to edit or change it.  Meantime, Research Gate has gotten started, and provides a somewhat better place to meet the world and archive my stuff.  And also, having the site on earthlink binds me to them and their 22 dollar a month fee.  So. …

 

I am wondering if there is a way (or a service that would) scrape the website and, possibly, dump it into a new and more reliable, more website creation medium?  Please, ambulatory knowledge only.  I don’t want a people doing deep searches to answer this  question . 

 

Thanks, as always .

 

Nick

 

Nicholas S. Thompson

Emeritus Professor of Psychology and Biology

Clark University

http://home.earthlink.net/~nickthompson/naturaldesigns/

 

============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Marcus G. Daniels

Yes, I’d usually use my phone for reading if it weren’t for the paywall boundaries.  If someone insists on forcing a PDF on me, fine, but it is so much faster to find things with HTML than tortured LaTeX output.  And minimize the damned JavaScript gumming everything up.  Simple declarative content.   Actual printouts are just a way to get more screen real estate than I have when I’m trying to learn something new. 

 

From: Friam [mailto:[hidden email]] On Behalf Of Gary Schiltz
Sent: Wednesday, January 04, 2017 3:34 PM
To: The Friday Morning Applied Complexity Coffee Group <[hidden email]>
Subject: Re: [FRIAM] scraping a web site

 

As if Nick's head isn't already spinning from all the advice, one thing I would mention is just how big a share of web access these days is via mobile devices, i.e. smartphones and tablets. According to some (reputable?) sources, glean from a single Google search, mobile use has overtaken desktop use (desktop includes laptops). If true, then is a good idea to design new web sites with this in mind, and use a platform that supports "responsive" design, where the platform detects the capabilities of the user's browser and formats pages accordingly. I haven't dug into Nick's site yet, so I don't how amenable the pages are to formatting for a four inch screen :-)

 

On Wed, Jan 4, 2017 at 5:03 PM, Gillian Densmore <[hidden email]> wrote:

Kirby eh? it was kind of quirk when I tried for what's worth. 

 

On Wed, Jan 4, 2017 at 1:55 PM, Nick Thompson <[hidden email]> wrote:

Thanks, Barry.  Fabulous.  N

 

Nicholas S. Thompson

Emeritus Professor of Psychology and Biology

Clark University

http://home.earthlink.net/~nickthompson/naturaldesigns/

 

From: Friam [mailto:[hidden email]] On Behalf Of Barry MacKichan
Sent: Wednesday, January 04, 2017 12:07 PM
To: The Friday Morning Applied Complexity Coffee Group <[hidden email]>
Subject: Re: [FRIAM] scraping a web site

 

Squarespace (https://www.squarespace.com) has a good reputation.


If your site doesn’t require any code on the back end (and I would guess it doesn’t, given its age) you could put it on Amazon S3, so you pay only for the storage space (a few cents per gigabyte).

--Barry

 

On 3 Jan 2017, at 21:49, Nick Thompson wrote:

Dear Phellow Phriammers,

 

I am in the uncomfortable position of being bound by threads of steel to Earthlink.  Many, MANY, years I go I started a website on Earthlink, {http://home.earthlink.net/~nickthompson/naturaldesigns/

}, and put a lot of my writing, and some commentary up on it.  The website creation and editing medium (trellix) was pretty good for its time, and there are many ways that I find the site quite satisfying.  But gradually Earthlink has withdrawn its support, and now I am not sure I could get in to edit or change it.  Meantime, Research Gate has gotten started, and provides a somewhat better place to meet the world and archive my stuff.  And also, having the site on earthlink binds me to them and their 22 dollar a month fee.  So. …

 

I am wondering if there is a way (or a service that would) scrape the website and, possibly, dump it into a new and more reliable, more website creation medium?  Please, ambulatory knowledge only.  I don’t want a people doing deep searches to answer this  question . 

 

Thanks, as always .

 

Nick

 

Nicholas S. Thompson

Emeritus Professor of Psychology and Biology

Clark University

http://home.earthlink.net/~nickthompson/naturaldesigns/

 

============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove

 


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove

 


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

glen ep ropella
In reply to this post by Gary Schiltz-4

Great idea!  I just added:

<meta name="viewport" content="width=device-width, initial-scale=1.0, maximum-scale=1.0, user-scalable=no, minimal-ui">

to all Nick's pages on my host.  It helped a lot on my phone.

On 01/04/2017 02:33 PM, Gary Schiltz wrote:
> As if Nick's head isn't already spinning from all the advice, one thing I would mention is just how big a share of web access these days is via mobile devices, i.e. smartphones and tablets. According to some (reputable?) sources, glean from a single Google search, mobile use has overtaken desktop use (desktop includes laptops). If true, then is a good idea to design new web sites with this in mind, and use a platform that supports "responsive" design, where the platform detects the capabilities of the user's browser and formats pages accordingly. I haven't dug into Nick's site yet, so I don't how amenable the pages are to formatting for a four inch screen :-)

--
glen ep ropella ⊥ 971-280-5699

============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
Reply | Threaded
Open this post in threaded view
|

Re: scraping a web site

Steve Smith
In reply to this post by Marcus G. Daniels

I thought this was a reference to the way siblings (and grown children) ask to store something in your garage or workshop "just for a weekend" and 20 years later you have to throw it out or give it to goodwill because they never picked it up, even through several moves!


On 1/4/17 1:49 PM, Marcus Daniels wrote:

Ah, like it's private storage sibling

 

 

-----Original Message-----
From: Friam [[hidden email]] On Behalf Of glen ?
Sent: Wednesday, January 04, 2017 1:41 PM
To: The Friday Morning Applied Complexity Coffee Group [hidden email]
Subject: Re: [FRIAM] scraping a web site

 

 

Mwahahahahah! [wrings hands]

 

No.  I just hate the way everyone tries to make money off what should be infrastructure.  Everyone should get their own website.  That they control entirely.  I keep intending to set up a permanent one for myself on IPFS <https://ipfs.io/, https://github.com/ipfs/ipfs>.  But I'm just too lazy.

 

For the record, I told Nick I'll host it until he figures out what he wants to do long-term.

 

On 01/04/2017 12:33 PM, Marcus Daniels wrote:

> Glen, is this like a `free’ signup to Hulu, right?   Cancel now, or expect an invoice?

 

--

glen

 

============================================================

FRIAM Applied Complexity Group listserv

Meets Fridays 9a-11:30 at cafe at St. John's College to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com

FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove



============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove


============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
to unsubscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/ by Dr. Strangelove
12