Noncompartmental Analysis

Introduction

Non-compartmental analysis (NCA) is a simple and quick method for evaluating the exposure of a drug. It allows you to evaluate things like linearity and in vivo exposure. To illustrate this consider an antibody given in a subcutaneous injection. The actual profile a patient might experience is given in the solid black line. But we don’t yet have the ability to sample in a continuous fashion. What we might observer is given by the blue points.

Generally NCA will determine the following directly from the data:

Cmax - Maximum observed concentration (units=concentration)
Tmax - The time where the maximum concentration was observed (units=time)
AUC - The area under the curve ( $units=time \times concentration$ )
AUMC - The area under the first moment curve ( $units=time^2 \times concentration$ )

These properties are all based on observational data. So the Cmax and Tmax will most certainly not be at the actual maximum concentration but as long as we sample judiciously it will give us a good approximation. Similarly, the calculated AUC and AUMC will be different than the actual values. To calculate the areas you need to dig back into your calculus text books to the trapezoid method. Basically each sampling interval is a trapezoid and the area of each is calculated and added up for all of the n samples:

$AUC = \int_0^{t_f} Cdt \approx \sum_{i=1}^{n-1}{\frac{C_i+C_{i+1}}{2}\times (t_{i+1}-t_{i})}, \ \ \ \ \ AUMC = \int_0^{t_f} t\times Cdt \approx \sum_{i=1}^{n-1}{\frac{t_iC_i+t_{i+1}C_{i+1}}{2}\times (t_{i+1}-t_{i})}$

This can be done in Excel pretty easily. Depending on the data and the analysis other properties can be calculated. For example we can calculate the clearance, steady-state volume of distribution and terminal half-life:

Clearance: $CL = \frac{Dose}{AUC}$
Mean residence time: $MRT = \frac{AUMC}{AUC}$
Steady state volume of distribution: $V_{ss} = MRT \times CL$
Half-life: Terminal slope of the natural log of the data

Properties like AUC and AUMC can also be be calculated using extrapolation from the last time point to infinity to account for data beyond the observations at hand. The subsequent values of clearance, volumes of distribution, etc can also change with extrapolation.

There is a lot of nuance associated with these calculations, and it is good to rely on software that focuses on this type of analysis. The PKNCA package has been developed with this in mind. Ubiquity provides a set of functions to automate NCA and reporting for preclinical data. These functions act as a wrapper for the PKNCA package which does most of the heavy lifting. Only a small subset of the PKNCA functionaltiy is used here. If more extensive analysis is necessary then PKNCA can be used directly.

This vignette contains a series of examples on how to perform NCA in ubiquity. To make a copy of these scripts and other supporting files in the current working directory run the following:

library(ubiquity)
fr = workshop_fetch(section="NCA", overwrite=TRUE)

This creates several files in the working directory. First are data sets:

pk_all_sd.csv PK data for multiple subjects dosed either IV or SC at different levels
pk_all_md.csv Multiple dose data with intensive sampling on the first and last dose
pk_sparse_sd.csv Single dose data with sparse sampling

Next the following scripts demonstrate how to perform NCA:

analysis_nca_sd.R Single dose data for individuals
analysis_nca_md.R Multiple dose data for individuals
analysis_nca_sparse.R Average NCA when analyzing sparsely sampled data

Quick Template for Running NCA

You should read the rest of the vigette below to understand the required data format and how the functions work. But if you are returning to this and you just want a template for running NCA you can use the following:

library(ubiquity)
cfg = build_system()
fr = system_fetch_template(cfg, template = "NCA")

This will create the file analysis_nca.R in the current working directory. You can just uncomment/edit that script to get started.

Single Dose Data

This example follows the script analysis_nca_sd.R, and uses the dataset pk_all_sd.csv.

Expected Format of Data

First we build the system and load the dataset.

cfg = build_system(system_file="system.txt")
cfg = system_load_data(cfg, dsname     = "PKDATA", 
                            data_file  = "pk_all_sd.csv")

The NCA functions in ubiquity expect data to have a certain format. There are columns that are required, those that are optional or depend on the kind of analysis being performed and other columns can also be present. In this context consider the following dataset:

	ID	TIME_HR	C_ng_ml	DOSENUM	DOSE	ROUTE
1	1	1.00	9953.88	1	30	▼ iv bolus
2	1	4.00	9704.51	1	30	▼ iv bolus
3	1	8.00	9383.72	1	30	▼ iv bolus
4	1	24.00	8223.55	1	30	▼ iv bolus
5	1	72.00	5685.31	1	30	▼ iv bolus
6	1	168.00	3118.78	1	30	▼ iv bolus
7	1	336.00	1673.89	1	30	▼ iv bolus
8	1	504.00	1215.22	1	30	▼ iv bolus
9	1	672.00	964.64	1	30	▼ iv bolus

	ID	TIME_HR	C_ng_ml	DOSENUM	DOSE	ROUTE


1
2
3
4
5
6
7
8
9

The required columns and their names in this dataset provided in parenthesis are:

ID Unique subject identifier (ID)
TIME Time since the first dose (TIME_HR)
NTIME Time since the last dose (since we are dealing with single dose data this is also TIME_HR)
CONC Observed concentration for this record (C_ng_ml in ng/ml)
DOSE Dose given (DOSE in mg)
ROUTE Route of administration (ROUTE); can be either: iv bolus, iv infusion or extra-vascular

Optional columns include:

DOSENUM When analyzing multiple dose data (example below), this column will be used to associate records with doses
BACKEXTRAP Back-extrapolation of IV data can be done generally for the entire dataset or this column can be used to specify the number of points to use on an individual basis
SPARSEGROUP Grouping for sparse sampling data where you want to average data at each time point

NCA and Outputs

Next we perform the NCA using system_nca_run:

cfg = system_nca_run(cfg, dsname        = "PKDATA", 
                           dscale        = 1e6, 
                           analysis_name = "pk_single_dose", 
                           extrap_C0     = FALSE, 
                           dsmap         = list(TIME    = "TIME_HR", 
                                                NTIME   = "TIME_HR", 
                                                CONC    = "C_ng_ml", 
                                                DOSE    = "DOSE",
                                                ROUTE   = "ROUTE", 
                                                ID      = "ID"),
                           digits        = 3)

We link the analysis to the dataset by specfying the dataset name (dsname) used when we load the dataset. The dosing in the dataset is in mg (i.e. 30 mg dose), but the mass units of the concentration values are in ng (i.e. ng/ml). The dscale input converts the mass units in the dose to the mass units in the observed concentration. The analysis_name ¹ is used both to refer to this analysis in the reporting function as well as in the files generated by the analysis. By default the initial concentration (nominal time zero) will be back-extrapolated but can be disabled by setting extrap_C0 to FALSE. Next we map the columns in the dataset to names used by the analysis (dsmap). Note that both the actual time (TIME) and nominal time (NTIME) both use the same column in the dataset (TIME_HR). Lastly, the digits input will define rounding rules for the reported values.

Running this function will produce the following files:

output/pk_single_dose-nca_summary-pknca.csv Summary level output from the analysis (see table below)
output/pk_single_dose-nca_data.RData R objects used for downstream reporting
output/pk_single_dose-pknca_raw.csv Raw output from PKNCA

	ID	Nobs	Dose_Number	Dose	Dose_CU	Cmax	Tmax	halflife	Vp_obs	Vss_obs	Vss_pred	C0
1	1	9	1	30	30000000.00	9950	1	423	3010	6070	6060	-1
2	2	9	1	30	30000000.00	11300	1	477	2660	5060	5060	-1
3	3	9	1	30	30000000.00	7420	1	500	4040	5520	5510	-1
4	4	9	1	30	30000000.00	5760	1	589	5210	8430	8430	-1
5	5	9	1	30	30000000.00	6460	1	574	4650	7300	7300	-1
6	6	9	1	30	30000000.00	9460	1	526	3170	5410	5410	-1
7	7	9	1	30	30000000.00	13800	1	627	2170	4380	4380	-1
8	8	9	1	30	30000000.00	7160	1	567	4190	6910	6910	-1
9	9	9	1	30	30000000.00	9180	1	561	3270	6120	6120	-1
10	10	9	1	30	30000000.00	4760	1	500	6300	10200	10200	-1

	ID	Nobs	Dose_Number	Dose	Dose_CU	Cmax	Tmax	halflife	Vp_obs	Vss_obs	Vss_pred	C0


1
2
3
4
5
6
7
8
9
10

The following columns are provided in the outputd, and when calculated with PKNCA output names provided in perenthesis are those from PKNCA:

ID Subject ID for when sparse = FALSE and group ID when sparse = TRUE
Nobs Number of observations used for calculation
Dose_Number Current dose number
Dose Dose (units from the dataset)
Dose_CU Dose (mass in units from concentration data)
Cmax Maximum observed concentration
Tmax Time of Cmax
halflife Terminal half-life (half.life)
Vp_obs When the dosing route is “iv bolus”, this is the volume calculated by dividing the dose in concentration units by the first observed concentration or C0 of extrapolation is selected. For other routes a value of -1 will be returned.
Vss_obs Steady-state volume of distribution based on observation data (vss.obs)
Vss_pred Steady-state volume of distribution with $C_{last,pred}$ as the final observation (vss.pred)
C0 Back-extrapolated initial concentration
CL_obs Clearance based on observation data (cl.obs)
CL_pred Clearance calculated using $C_{last,pred}$ as the final observation (cl.pred)
AUClast Area under the curve based on observation data (auclast)
AUCinf_obs Area under the curve extrapolated to infinity using observed data (aucinf.obs)
AUCinf_pred Area under the curve extrapolated to infinity using $C_{last,pred}$ as the final observation (aucinf.pred)

Where $C_{last,pred}$ is the predicted concentration at the final sample time by regression of the terminal phase of data.

Automated Reporting

PowerPoint

Once the NCA has been run, the results can be appended to an open PowerPoint report. Here we initialize an empty report then use the function system_report_nca and the analysis name assigned to the analysis above (pk_single_dose) to attach those results. Then we can write the file:

cfg = system_report_init(cfg, rpttype="PowerPoint")
cfg = system_report_slide_title(cfg, title = "NCA of Single Dose PK")
cfg = system_report_nca(cfg, analysis_name = "pk_single_dose")
system_report_save(cfg, output_file=file.path("output", "pk_single_dose-report.pptx"))

For each dose a summary slide will be produced and a full timecourse will be created showing all of the data for that subject/group in the dataset:

Actual data in grey
Data used for NCA in green
Initial extrapolated concentrations in orange (solid)
Points used for extapolating “iv bolus” data are shown in orange open circles

Below shows the slides generated for an individual subject and the first set of summary slides for the analysis. Notice that because extrap_C0 was set to FALSE C0 was not calculated (-1). Because this individual received a SC dose the estimate for Vp is also not calcualted (-1). Also note that the data for this inidivudal did not allow for extrapolation of AUC to infinity (NA). As a result the parameters that depend this value also resulted in (NA). The timecourse figure shows the data in the dataset (grey closed symbols, solid line) and the data used for NCA (green open symbols, dashed line). This way you can visually confirm at the subject level what data was used.

Reporting for single dose NCA

Word

Similarly, a Word report can be generated by appendign the report to an already initialzed Word report. This is done by setting the rpttype to "Word" when calling system_report_init. This will attach the same content to the report as with PowerPoint report above.

cfg = system_report_init(cfg, rpttype="Word")
cfg = system_report_nca(cfg, analysis_name = "pk_single_dose")
system_report_save(cfg=cfg, output_file=file.path("output", "pk_single_dose-report.docx"))

For more information on integrated report generation see the Reporting vignette.

Multiple Dose Data

This example follows the script analysis_nca_md.R, and uses the dataset pk_all_md.csv. If we rebuild the system and load the multiple dose dataset we can see that the dataset looks almost identical to the single dose data set. The primary difference is that there are two extra columns: NTIME_HR and EXTRAP. The first column (NTIME_HR) contains the time since the last dose. The EXTRAP column is optional and allows the user to specify the number of points to back extrapolate C0 for iv bolus dosing. This will have no effect unless it is specified in the dsmap below. If you scroll through the data you can see that there is intensive sampling for DOSENUM 1 and 6. But for doses 2-5 there are only three samples per interval. These latter dose intervals (2-5) will be ignored with the default value of NCA_min (4) which defines the minimum number of saples required for analysis.

cfg = build_system(system_file="system.txt")
cfg = system_load_data(cfg, dsname     = "PKDATA", 
                            data_file  = "pk_all_md.csv")

	ID	TIME_HR	C_ng_ml	DOSENUM	DOSE	ROUTE	NTIME_HR	EXTRAP
1	1	1.00	9955.06	1	30	▼ iv bolus	1.00	4
2	1	4.00	9705.64	1	30	▼ iv bolus	4.00	4
3	1	8.00	9385.61	1	30	▼ iv bolus	8.00	4
4	1	24.00	8223.38	1	30	▼ iv bolus	24.00	4
5	1	72.00	5748.75	1	30	▼ iv bolus	72.00	4
6	1	168.00	3126.79	1	30	▼ iv bolus	168.00	4
7	1	336.00	1677.76	1	30	▼ iv bolus	336.00	4
8	1	504.00	1215.61	1	30	▼ iv bolus	504.00	4
9	1	672.00	964.64	1	30	▼ iv bolus	672.00	4

	ID	TIME_HR	C_ng_ml	DOSENUM	DOSE	ROUTE	NTIME_HR	EXTRAP


1
2
3
4
5
6
7
8
9

Next we perform the NCA and reporting as before. The only difference here is that we’ve removed the extrap_C0 option. The default value is TRUE so back-extrapolation will occur. For IV dosing this is log-linear extrapolation to a nominal time of zero. For SC dosing this will be 0 for the first dose and the last observation from the last dosing interval for subsequent dosing.

cfg = system_nca_run(cfg, dsname        = "PKDATA", 
                          dscale        = 1e6, 
                          analysis_name = "pk_multiple_dose", 
                          dsmap         = list(TIME    = "TIME_HR", 
                                               NTIME   = "NTIME_HR", 
                                               CONC    = "C_ng_ml", 
                                               DOSE    = "DOSE",
                                               ROUTE   = "ROUTE", 
                                               ID      = "ID",
                                               DOSENUM = "DOSENUM",
                                               EXTRAP  = "EXTRAP"),
                          digits        = 3)
cfg = system_report_init(cfg)
cfg = system_report_slide_title(cfg, title = "NCA of Multiple Dose PK")
cfg = system_report_nca(cfg, analysis_name = "pk_multiple_dose")
system_report_save(cfg, output_file=file.path("output", "pk_multiple_dose-report.pptx"))

The same files are generated with the pk_multiple_dose prefix in the out output folder. The summary can be seen here:

	ID	Nobs	Dose_Number	Dose	Dose_CU	Cmax	Tmax	halflife	Vp_obs	Vss_obs	Vss_pred	C0
1	1	9	1	30	30000000.00	9960	1	421	3010	6000	5980	10039.61
2	2	9	1	30	30000000.00	11300	1	476	2650	5020	5020	11648.53
3	3	9	1	30	30000000.00	7420	1	499	4040	5490	5490	7446.80
4	4	9	1	30	30000000.00	5780	1	591	5190	8410	8410	5955.48
5	5	9	1	30	30000000.00	6490	1	572	4630	7270	7270	6705.99
6	6	9	1	30	30000000.00	9470	1	526	3170	5380	5380	9654.34
7	7	9	1	30	30000000.00	13800	1	627	2170	4350	4350	14129.13
8	8	9	1	30	30000000.00	7160	1	567	4190	6870	6870	7223.51
9	9	9	1	30	30000000.00	9180	1	560	3270	6070	6070	9258.60

	ID	Nobs	Dose_Number	Dose	Dose_CU	Cmax	Tmax	halflife	Vp_obs	Vss_obs	Vss_pred	C0


1
2
3
4
5
6
7
8
9

Below you can see the report slides for two subjects. The first subject was dosed IV. Again the grey markers/solid line show the full timecourse for that subject from the dataset. The data used for NCA (dose 1 and 6) is shown in green. The solid orange marker shows the extrapolated C0. If you look closely the open orange markers show the data points used for extraplation conneced with an orange dashed line. The second subject was given SC doses. For the first dose the extraplated C0 was zero. This can be seen by the green dashed line extendeding down to zero. The C0 for dose 6 is simply the last observation carried forward as shown by the solid orange marker.

Reporting for multiple dose NCA

Sparse Sampling

This example follows the script analysis_nca_sparse.R, and uses the dataset pk_all_sparse.csv. First we rebuild the system and load the sparsely sampled dataset. This data is very similar to the single dose dataset above except each ID only has three samples.

cfg = build_system(system_file="system.txt")
cfg = system_load_data(cfg, dsname     = "PKDATA", 
                            data_file  = "pk_sparse_sd.csv")

	ID	TIME_HR	C_ng_ml	DOSENUM	DOSE	ROUTE
1	1	1.00	9953.88	1	30	▼ iv bolus
2	1	24.00	8223.55	1	30	▼ iv bolus
3	1	336.00	1673.89	1	30	▼ iv bolus
4	2	4.00	10368.30	1	30	▼ iv bolus
5	2	72.00	5145.76	1	30	▼ iv bolus
6	2	504.00	2692.99	1	30	▼ iv bolus
7	3	8.00	7221.01	1	30	▼ iv bolus
8	3	168.00	4292.95	1	30	▼ iv bolus
9	3	672.00	1851.79	1	30	▼ iv bolus

	ID	TIME_HR	C_ng_ml	DOSENUM	DOSE	ROUTE


1
2
3
4
5
6
7
8
9

When we run the NCA we need to tell the function that we want to perform a sparse analysis. This is done by setting the sparse input to TRUE. We also need to provide information on how to group cohorts. This is done by providing a SPARSEGROUP option in the dsmap. In this case we can just use the DOSE column, however a different column could have been used.

cfg = system_nca_run(cfg, dsname        = "PKDATA", 
                          dscale        = 1e6, 
                          analysis_name = "pk_sparse",
                          sparse        = TRUE,
                          dsmap         = list(TIME        = "TIME_HR", 
                                               NTIME       = "TIME_HR", 
                                               CONC        = "C_ng_ml", 
                                               DOSE        = "DOSE",
                                               ROUTE       = "ROUTE", 
                                               ID          = "ID",
                                               SPARSEGROUP = "DOSE"),
                          digits        = 3)
            

cfg = system_report_init(cfg)
cfg = system_report_slide_title(cfg, title = "NCA of Sparsely Sampled PK")
cfg = system_report_nca(cfg, analysis_name = "pk_sparse")
system_report_save(cfg=cfg, output_file=file.path("output", "pk_sparse-report.pptx"))

Analysis of sparse data will calculate an average concentration at each time point and use those average values for NCA. The same files are generated from the analysis. You can see the summary here:

	ID	Nobs	Dose_Number	Dose	Dose_CU	Cmax	Tmax	halflife	Vp_obs	Vss_obs	Vss_pred	C0
1	1	9	1	30	30000000.00	8430	8	1050	3730	8160	8170	8386.78
2	2	9	1	120	120000000.00	13500	168	1050	-1	11400	11400	0.00
3	3	9	1	300	300000000.00	84300	8	1050	3730	8160	8170	83867.84

	ID	Nobs	Dose_Number	Dose	Dose_CU	Cmax	Tmax	halflife	Vp_obs	Vss_obs	Vss_pred	C0


1
2
3

To confirm what was done and identify any outliers that may be causing problems, you can use the report output. Again the grey markers show the data, the green dashed line will show the value used for NCA, and the orange will show the C0 estimate.

Reporting for sparse sampling NCA

analysis names must start with a letter and containing only letters, numbers, and _↩