The Apache Software Foundation has announced the naming of a set of new top-level development projects. These include the Traffic Server, Mahout, Nutch, Avro, HBase, and Tika. "Apache Tika is an embeddable, lightweight toolkit for content detection, and analysis. Powering by MIME standards from IANA, advanced language detection features and on the ability to rapidly unify existing parser libraries, Tika provides a one-stop shop for navigating the modern information landscape."
Read More... Subscription may be required on lwn.net


LinkBack URL
About LinkBacks




Reply With Quote