Overview
We are collecting data from a Japanese web site.
However the previous programmer has gone walk about and we can not contact them.
The scraper has stopped working correctly, maybe because of updates to the site, updates to Firefox or Greasemonkey.
These are the three factors we use to run it on Windows but in reality it would be nicer if we can run it also on Linux using Firefox.
For the time being though we need to collect the data again as soon as possible.
We can connect to the machine running the scripts using Teamviewer: ID 318208345 and Pass web883388
The script uses an inbuilt Java software server called Data Miner Server v1.4.0 running from the desktop to collect the data and put it into the correct folders.
This also parses the data into a csv file from HTML but we collect both just in case.
The script is run in fireforx itself using Greasemonkey Plugin for Firefox. This script is held on the desktop and is called.
kyotei_data_miner_v.1.0.2user
The web site has a selection of RACES that are held throughout the day at many different venues and we need ot collect
the odds for those races in real time and finally the results pages when they appear. Every day the venues can change.
Looking through the script you will see it knows which venues to collect. (But it has stopped collecting)
We run the script over the Tor network for some privacy and Tor is running on the machine aswell as in Firefox.
Races often run after 8 or 9 pm and should be running now.
We need to update the script and have it running as soon as possible.
If the machine is rebooted the Tor , Scripts and Server should all begin automatically.
The scripts should reset itself each morning so as to get the data automatically for that day.
The website collected is http://www.mbrace.or.jp/od/O/Nindex.html
The previous programmer added Foxy Proxy to Firefox just before leaving and maybe this is needed now.
- Homepage: http://www.mbrace.or.jp/od/O/Nindex.html