machine learning web mining crawler work flexibly data mining spider web data collection tax and assessment software ant tier one web applications business intelligence internet