Member-only story

Analysing Research Books: A Solution to Analytics Vidhya’s Text Analysis Competition

Crystal X
4 min readOct 6, 2020

--

Analytics Vidhya is an Indian company specialising in data science company specialising in providing courses to data science students. In the midst of the COV19 pandemic and on the Indian Independence Day, Analytics Vidhya launched a Janatahack competition for this purpose. The problem statement for this competition is as follows:-

Researchers have access to large online archives of scientific articles. As a consequence, finding relevant articles has become more difficult. Tagging or topic modelling provides a way to give token of identification to research articles which facilitates recommendation and search process.

Given the abstract and title for a set of research articles, predict the topics for each article included in the test set.

Note that a research article can possibly have more than 1 topic. The research article abstracts and titles are sourced from the following 6 topics:

1. Computer Science

2. Physics

3. Mathematics

4. Statistics

5. Quantitative Biology

6. Quantitative Finance

In order to complete the task, I had to first import the most common libraries that I would need:-

--

--

Crystal X
Crystal X

Written by Crystal X

I have over five decades experience in the world of work, being in fast food, the military, business, non-profits, and the healthcare sector.

No responses yet