How to categorise millions of organic compounds?