i would like to have the data from this website: http://prosurf.sbb.ch/pros/inter/mainsite_e.html as xml. This website displays for almost every Swiss trainstation the exact arrival and departime times of trains.
The website lists the trainstations on alphabet, so http://prosurf.sbb.ch/pros/inter/prosurfservlet?TRANSACTION=093&ENTRYPAGE=A shows all trainstations beginning with A, http://prosurf.sbb.ch/pros/inter/prosurfservlet?TRANSACTION=093&ENTRYPAGE=B shows it with B, etc etc.
For each individual trainstation (e.g. Lausanne, http://prosurf.sbb.ch/pros/inter/prosurfservlet?TRANSACTION=004&LANGUAGE=e&a mp;PBP=LS&DIRECTION=2 ) the page shows the arrivaltimes (http://prosurf.sbb.ch/pros/inter/prosurfservlet?transaction=004&language=e& amp;pbp=LS&direction=1) and on a second page the departure times (http://prosurf.sbb.ch/pros/inter/prosurfservlet?transaction=004&language=e& amp;pbp=LS&direction=2)
On each page, the individual trains are linked (trainnumbers). When clicking on such a trainnumber, you will see the full details of this trainline, including (in the field "current") the expected delays.
I would like to have as endresult one large XML-file OR 26 XML-files (for each letter of the alphabet, one) which contains:
the trainstations
per station, the trainnumber
per trainnumber, the data: type of train, start station, final destination, expected arrival and departure time, actual arrival and departure time, delay-time departure and delay-time arrival.
This "crawler" should update its information every 5 minutes so i have a real uptodate newsfeed.
BE AWARE: this website has some built-in mechanisms to prevent leeching (max. pages per minutes allowed, ip-tracking possibly) so for this a solution should be found as well (proxy-usage, caching, etc.)?
接包方 | 国家/地区 | |
---|---|---|
![]() |
3
Evontech
|
|
![]() |
3
Jasonterhorst
|
|
![]() |
3
Canbayir
|
|
![]() |
3
Akswebsolutions
|
|
![]() |
3
Iphonedeveloper
|
|
![]() |
3
Alxmarcelo
|
|
![]() |
3
Djworth
|
|
![]() |
3
Mmandal
|
|
![]() |
3
Iprog
(中标)
|
|
![]() |
3
Soyatec
|
|
![]() |
2
Nurvo
|
|
![]() |
2
Arunp
|
|
![]() |
2
Endeavoursoftware
|
|
![]() |
2
Rapidvalue
|
|
2
Jim_manley
|
||
![]() |
2
Glukin
|
|
![]() |
2
Dougallbright
|