data analytics

Letting the Data Tell the Story

2017-08-03T17:10:10+00:00

In our previous post we described the technique for assigning categories to data, based on input from content experts within a “training database”.  This technique is effective for summarizing large, text-heavy data into specific categories for summaries and improved visualization.  While this approach is useful for those purposes, it will not allow us to uncover new insights or trends because we are imposing a preconceived and finite set of options, or in other words, what we already know. The following describes our approach to using clustering techniques for exploring text-heavy data. We applied this technique to two different datasets: scientific journals [...]

Letting the Data Tell the Story2017-08-03T17:10:10+00:00

USING NLP TECHNIQUES TO CLASSIFY PATIENT SEGMENTS IN CLINICAL TRIAL DATA

2017-08-02T19:24:26+00:00

One of the most common and powerful approaches in NLP provides the content experts an opportunity to label each data segment for a portion of the dataset and then analyze these labels to apply to the rest of the dataset. Some key questions need to be answered when applying this approach in different environments.   For example, how many “expert” labels do we need to create before the classification works effectively?  How can we evaluate this in advance?  Are we limiting ourselves to only extracting from the data what we already believe to be true?  We will discuss that last one in [...]

USING NLP TECHNIQUES TO CLASSIFY PATIENT SEGMENTS IN CLINICAL TRIAL DATA2017-08-02T19:24:26+00:00
Go to Top