Airavata Sanbox for Machine learning,Rspark,Sage

Airavata a hyper-v vhd based sandbox with all prebuilt and set up tools to get started with rspark for Machine learning,SageMath

NLTK For tagging software properties

One of the problems that I have set out to solve here would be to identify various Parts of speech forms specifically Noun forms like locations, Organization names Addresses, version numbers etc.. These can be used in automating the process of cataloguing various applications as a database for one of a many possible applications.While this cant be an out and out solution but it can certainly help or Aid a manual cataloguer in simplifying the process of running over multiple websites and extensively searching this can probably reduce a lot of manual cataloguing task.

