If you need to make tool that would collect information over http or web socket I could write custom protocol implementation based on raw sockets.I have experience of writing similar tools to analyze forex market, it did utilize 100 MB bandwidth with over 2000 simultaneously connections include quote streaming. Also I could wrap extracted information on suitable format, pdf/doc/xml/excel or any another formats, or proxy the stream to required service.