Skip to main content

Analyze summary statistics chart data

The summary statistics charts summarizes essential aspects of lab test data, including measures such as the mean, median, mode, standard deviation, and range. This information provides a concise overview of the dataset's central tendency and variability, making it easier to interpret lab results and assess overall trends.

In EDC, the Summary Statistics chart is comprised of the boxes that represents statistical measurements of a dataset drawn vertically on the chart. The central box represents the interquartile range (IQR), with the lower and upper edges indicating the first and third quartiles, respectively. A line inside the box represents the median. Whiskers extend from the edges of the box to indicate the range of data excluding any outliers. Outliers are the values that lie outside the range of whiskers and are represented on the chart as green circles.

Vertical box plot schema
Figure 1. Vertical box plot schema

Example of summary statistics chart in EDC
Figure 2. Example of summary statistics chart in EDC

To analyze the summary statistics chart data
  1. In the EDC application header, select the DASHBOARD tab.

  2. On the page that opens, select the Lab Test Analytics tab.

    Accessing lab test analytics dashboard
    Figure 3. Accessing lab test analytics dashboard

  3. On the workspace toolbar, from the Lab Test dropdown menu, select the lab test that you want to review.

    Selecting lab test to review
    Figure 4. Selecting lab test to review

  4. On the Lab Test Analytics dashboard that opens, start analyzing the Summary Statistics chart data by hovering over the boxes and data points to view the tooltips and comparing different values for different categories of data where relevant.

    Reviewing summary statistics chart data
    Figure 5. Reviewing summary statistics chart data

  5. Now from the chart, select Detail info_icon.png. In the table that opens, view the summary statistics details as explained in the following table.

    Reviewing summary statistics details
    Figure 6. Reviewing summary statistics details

    Tip

    To enlarge the table view, from the chart, switch to the Full Screen Full_screen_icon.png mode.

    Column

    Details

    Group By - Country

    Represents the grouping variable for the analysis, which in this case, is country. This parameter categorizes the data based on different countries, allowing for comparative analysis of outcomes, behaviors, or characteristics across the geographic regions.

    Statistical Methods

    Represents the statistical techniques employed to analyze the data within each group. Common methods for the vertical box plot chart include the following:

    • Max: represents the highest observed data point in the dataset. In the box plot chart, this value is indicated by the upper whisker line and helps identify the range of the data.

    • Min: represents the lowest observed data point in the dataset. In the box plot chart, this value is indicated by the lower whisker line and helps identify the range of the data.

    • Median: represents the middle values of the dataset when it is ordered from lowest to highest. In a box plot chart, it is represented by a line within the box, dividing the box into two equal halves.

    • Mean: represents the average value of the dataset, calculated by summing all the values and dividing by the number of observations. This value provides a measure of central tendency, which can complement the median.

    • Std Dev (standard deviation): represents the amount of variation or dispersion in the dataset. This value is often referenced to understand the variability of the data.

    • N (sample size): represents the total number of observations or data points in the dataset. It is essential for understanding the statistical significance of the results presented in the box plot.

    All Visit

    Represents the total count of visits or observations included in the analysis for each group. This count is important for assessing the reliability and validity of the findings presented in the analysis.

    Screening

    Represents the statistics gathered during the screening phase of the study. It includes baseline lab test results collected before participants are enrolled in the trial.

    This data is crucial for determining eligibility and establishing the initial health status of the subject.

    C1D1 (Cycle 1, Day 1)

    Represents the lab test results collected on the first day of the first treatment cycle. This column summarizes essential statistics, such as mean, median, and other measures, reflecting the subject's lab values immediately before or after the initiation of the treatment.

    EOT (End of Treatment)

    Represents the lab test results collected at the end of the treatment phase for the study participants. This static is essential for accessing the final health status of the subjects after completing the assigned treatment regime.

    Unscheduled Visit.1 (n+1)

    Represents the statistics from the first unscheduled visit, which occurs outside of the predefined study schedule. It includes the lab test results that may have been collected due to patient-reported issues or adverse events.

    This chart may have more than one unscheduled visits. These visits reflect additional lab test results that are collected outside the regular visit schedule, aiding in the assessment of any changes in the subject's health status as the trial progresses.

Once you have analyzed the details of the Summary Statistics chart, select Detail_icon_enabled.png to close the table and return to the chart view.