I need to sample 1 million web pages for their languages.
after generating random IPs , I need to know which ones belong to a web server. How can I check this.
(I pinged yahoo.com got their IP 188.8.131.52 ,
but using java
InetAddress addr = InetAddress.getByName("184.108.40.206");
I get addr.getHostName as "b1.www.vip.mud.yahoo.com" which is not reachable using the InetAddress.isReachable(maxTimeInMsToTry)
any help will be much appreciated.
I can help you with this. However I'd like to know "why are you doing this?". You'll understand that this kind of activity is a very greedy use of network resources. I'd also prefer to discuss this off-line with you as I don't want to discuss this openly. You can contact me directly from my web site: http://clanmills.com.