Data Submission
Question | Teams Collective Response |
---|---|
When is it a good idea to submit an eCTD or standardised data sample to the FDA, since ‘they are not reviewed by FDA reviewers at any time’, per FDA TCG section 7.3 [eCTD Sample Submission] (https://www.fda.gov/media/153632/download)? Here are some possible situations:
| PHUSE Team : 15 August 2024 The purpose of submitting a standardised data sample to the FDA is to determine if the gateway submission is successful. As per Submit an eCTD or Standardized Data Sample to FDA | FDA – the document that is footnoted in FDA TCG section 7.3 [eCTD Sample Submission] (https://www.fda.gov/media/153632/download) – it is indicated that: ‘The validation of sample submissions does not involve scientific review of the content and is only intended to address conformance to FDA supported electronic submission and data standards.’ It is probably best to interpret the TCG statement of they are not reviewed by FDA reviewers at any time’ as referring to ‘scientific review of content’, and, without confirming with the FDA directly, a good faith understanding that the FDA could possibly review from the perspective of ‘conformance to FDA supported electronic submission and data standards’. |
How do I interpret the FDA Study Data Technical Conformance Guide section 3.3.2 (Dataset Size) with respect to documentation of submitted datasets in Define-XML? Should I only consider documenting the non-split datasets/domain in the Define-XML; that is, only documenting the datasets present in the datasets folder in Define-XML, while the split datasets that are submitted under the ‘split’ sub-folder should only be explained in the Data Reviewer’s Guide? | PHUSE Team : 29 August 2023 There are generally two situations for splitting datasets: because of dataset size constraint, or for illustrative purposes. When splitting for dataset size constraint, the split is done after the creation of the whole dataset. Section 7.2 of the Study Data Technical Conformance Guide states: “If you need to split a file that exceeds file size limits (see section 3.3.2), you should submit the smaller split files in the ‘split’ sub-folder in addition to the larger non-split file in the original data folder. There is no need for a second Define-XML file to be submitted within the split subfolder.” There is no need to document the split datasets in Define-XML for submission. These split datasets only need to be documented in the Data Reviewer’s Guide. When splitting for illustrative purposes, such as the FA or QS domain, the FDA’s eData team recommends submitting the split datasets without the inclusion of the whole dataset. Document only the split datasets in the Define-XML. It is recommended to discuss this illustrative split with the regulatory agency prior to submission. Note that the SDTM Metadata Submission Guidance discusses this as well: “The split datasets do not require additional Define-XML documentation.” The ADaM Metadata Submission Guidance does not mention split. |
If a sponsor has to submit additional datasets (e.g. ADaM) to the already submitted study to the FDA, should the sponsor submit Define-XML that includes all the datasets (new and old) or a Define-XML with just the new datasets? Will these new datasets be placed in the same folder as the original datasets? Or will they reside in a separate folder? Should the Define file be called the same name (i.e. Define-XML) to avoid conflict with the original Define file with old datasets if only new datasets are submitted later? | PHUSE Team Response : 10 June 2023 The Study Data Technical Conformance Guide (TCG) [https://www.fda.gov/regulatory-information/search-fda-guidance-documents/study-data-technical-conformance-guide-technical-specifications-document], currently v5.2, updated in May 2023, has two relevant sections that assist in addressing this question. Section 4.1.1.2 (SDTM General Considerations) indicates the following: Each submitted SDTM dataset should have its contents described with complete metadata in the Define-XML file (see section 4.1.4.5) and within the cSDRG as appropriate (see section 2.2). When updated datasets (e.g. ‘ae.xpt’, ‘lb.xpt’) are submitted, updated and complete Define-XML and cSDRG covering all datasets should be submitted using the “replace” life cycle operator to update the original file. Section 4.1.2.2 (ADaM General Considerations) indicates the following: Each submitted ADaM dataset should have its contents described with complete metadata in the Define-XML file (see section 4.1.4.5) and within the ADRG as appropriate (see section 2.3). Unlike SDTM, where it is required to always maintain the same SDTM dataset names, ADaM datasets that are resubmitted can have new dataset names that are different from the original submission. Regarding the Define-XML specifically, the TCG wording can be interpreted at least two ways: 1) Each time new information is submitted, a new Define-XML is needed. 2) Only the first Define-XML is considered in the submission, so anything further requested is just supplementing what is already there. Sponsor experience with ADaM dataset resubmission has varied. In some cases, after ADaM datasets were initially accepted by the FDA and then more data was requested, a separate Define-XML, along with the new ADaM datasets, was provided. In the experience of some sponsors, new ADaM datasets have been provided to the FDA with or without the Define-XML dependent on the time frame and the speed of the response. If ADaM datasets have been accepted by the FDA and more data is requested, a Define-XML may be prepared with new data only in a separate directory. The Define-XML may not need to be renamed. Sometimes the same ADaM dataset names can be used. If there are challenges in set-up or other considerations, different ADaM dataset names can be used as long as they are clearly explained in the response letter. Sometimes the FDA accepts additional ADaM datasets without a Define-XML, but this is an exception. When resubmitting subsets of ADaM data, the successful experience of some sponsors is to only include updated datasets in Define-XML to avoid broken links. There is not a one-size-fits-all solution for ADaM dataset resubmission since it depends on the circumstances, so discussion with the FDA review division is essential. The location of resubmissions within m5 of the eCTD is the responsibility of the regulatory publishing team members. |
Do sponsors who are considering submissions for oncology studies under RTOR (Real Time Oncology Review) need to adhere to the OOD Safety Data Specifications that are mentioned in this RTOR guidance? It seems that these specifications were still under review by CDISC. What is the current status of the review? Even if it is under review, what is the FDA’s position on adhering to these specifications for upcoming submissions under RTOR? | PHUSE Team Response : 25 April 2023 As per the FDA (email dated 11 April 2023), OCE/OOD safety data specifications are meant to serve as a good practice recommendation and are not considered mandatory at this time. However, the FDA has sought feedback and is planning to release an updated version shortly. Flexibility is included for sponsors on how the ADaM datasets are submitted under RTOR while still following ADaMIG v1.1 CDISC guidance. For formal advice on RTOR data submission for a study under consideration, the sponsor organisation should submit the questions in a meeting package to the corresponding review division. |
How should a sponsor handle requests from the FDA related to the creation of a table containing outputs in the pivotal trials of a submission, including hyperlinks to the code used to create the table and the primary ADaMs used? (See the example below.) This request is a) different from the ARM deliverable embedded in the Define-XML that we create as part of a submission and b) not specified in any FDA guidance to date. | PHUSE Team Response: 28 February 2023 While this has not been commonly experienced, the response and method of providing the information may be specific to when it has been requested by the regulatory agency. If the request is made by the regulatory agency ahead of the submission, the sponsor may utilise the Analysis Data Reviewer's Guide (ADRG) section 7.2 and include the additional columns and hyperlinks for Input Data and Program File. For requests by the regulatory agency after the submission, the sponsor may consider providing a document containing the list of program files, the output number and output title produced by the program file, and a description of each where necessary. Refer to the Study Data Technical Conformance Guide for a list of valid file types that can be included in the submission to the FDA. Sponsors should discuss the need to provide executable code with the regulatory agency ahead of providing the program files. |
When you submit a custom ADaM dataset like ADAE2 (in addition to ADAE), how do you validate it prior to your submission? | PHUSE Team Response: 25 August 2022 Sponsors can include multiple ADAE datasets in a study as there is no guidance dissuading the use of multiple analysis datasets for adverse events. The use of multiple ADAE datasets will also not result in any Pinnacle 21 finding. However, it is possible for a reviewer from a given regulatory agency to expect a single ADAE dataset, in which case the sponsor may be required to prepare an overall dataset. One method of combining all adverse events into a single dataset may be the use of ACATy or the use of multiple TEAE flags corresponding to each treatment drug in addition to the primary TEAE flag (refer to the ADaM Structure for Occurrence Data (OCCDS) Implementation Guide v1.1). Additionally, AOCCFL can be utilised in combination with TRTEMFL, and TRTEMwFL and AOCCzzFL can be created for the drug w. |
What kind of information about a subject with multiple screenings needs to be submitted to the FDA? The FDA Study Data Technical Conformance Guide (v4.8.1, October 2021) includes the following description regarding a subject with multiple screenings: 4.1.1.2: “Subject Identifier (SUBJID) The variable SUBJID uniquely identifies each subject that participates in a study. If a single subject is screened and/or enrolled more than once in a study, then the subject’s SUBJID should be different for each unique screening or enrollment. For a study with multiple screenings and/or multiple enrollments per subject, SUBJID should be included in other related domains besides DM even though it may cause validation errors. It is recommended to include a table linking each SUBJID for a single subject to that subject’s USUBJID with any additional necessary explanation included in the relevant RG.” 4.1.1.3: “DM Domain (Demographics) Screen failures, when provided, should be included as a record in DM with the ARM, ARMCD, ACTARM, and ACTARMCD field left blank. For subjects who are randomized in treatment group but not treated, the planned arm variables (ARM and ARMCD) should be populated, but actual treatment arm variables (ACTARM and ACTARMCD) should be left blank.
| PHUSE Team Response: 11 March 2022 While the FDA's Study Data Technical Conformance Guide (v4.8.1, October 2021) sections 4.1.1.2 and 4.1.1.3 specifically mention how to store data related to subjects with multiple screenings or enrolments in the SDTM domains, sponsors may be varied in their approach to identifying records associated to each screening or enrolment attempt. The Multiple Subject Instances Team at CDISC is currently working on creating the new DC domain that will address this issue. For each USUBJID, this domain will contain multiple entries for each time that the subject screened or enrolled into the study. The SUBJID value will reflect the subject identifier for that time of participation. The Multiple Subject Instances Team at CDISC is also recommending the SUBJID to be included in the parent domain as a permissible variable, following the FDA Study Data Technical Conformance Guide section 4.1.1.2. The new DC domain and this recommendation will be included in the future release of SDTM IG v4.0. The SDTM ADaM Implementation FAQ provided the following examples of how some sponsors are capturing multiple subject instances using the current SDTM domains and variables. Your company may choose to represent this data differently. It is suggested that any concerns about the way the data is captured be discussed with the FDA prior to submission to ensure that it will not cause a denial at the time of submission. Some sponsors may set the value of SUBJID to be consistent within each domain, matching it to some portion of the value of USUBJID, and use the --REFID variables to map each screening or enrolment subject identifier. The use of --REFID for this purpose should be mentioned in the cSDRG. Instead of generating the custom domain to capture additional screening identifiers or enrolment identifiers (mentioned in section 4.1.1.3 of the FDA Study Data Technical Conformance Guide), sponsors may be capturing the additional identifiers in the supplemental qualifier of the DM domain (SUPPDM). The additional data stored in SUPPDM should be described in the cSDRG. Some sponsors have already added the SUBJID into the parent SDTM domains, following the FDA Technical Conformance Guide section 4.1.1.2. Some sponsors have defined the SVSTDTC and SVENDTC variables in the SV domain to be the start and end dates of the first attempts at screening or enrolment and last attempt of screening or enrolment, respectively. Similarly, the SESTDTC variable in the SE domain can been defined as the very first informed consent date. |
Does clinical trial data need to be submitted at the time of an IND submission? If it is needed, does a full data package need to be submitted, including all SDTM domains, ADaM datasets, reviewer’s guides, and Define files with the complete validation? | PHUSE Team Response: 11 March 2022 While clinical trial data and the complete data package are not required to be submitted at the time of IND, it would provide the FDA with an early look at the clinical data and the ADaM datasets that the sponsor is considering for the study. The data submitted may include any previous experience with the study drug in humans (often from foreign use). It is always good to discuss the need for submitting data at the time of the IND with the review division at the agency at the Pre-IND meeting. The review division can then decide if they would like to receive and review the data. |
At one point there was a joint CDISC/FDA team working on defining locations in SDTM/ADaM for BIMO components so that the CLINSITE information could be pulled from the submitted data instead of a separate dataset. This joint effort has currently been put on hold. However, at this point, the team recommends to continue to reference the current BIORESEARCH MONITORING TECHNICAL CONFORMANCE GUIDE. (https://www.fda.gov/regulatory-information/search-fda-guidance-documents/bioresearch-monitoring-technical-conformance-guide https://www.fda.gov/regulatory-information/search-fda-guidance-documents/bioresearch-monitoring-technical-conformance-guide). and Standardized Format for Electronic Submission of NDA and BLA Content for the Planning of Bioresearch Monitoring (BIMO) Inspections for CDER Submissions. | PHUSE Team Response: 13 August 2020 The team has provided their response to the question on "Requirements for Clinical Site Data and Subject Level Data Listings for FDA CDER's Inspection Process (also called BIMO submission or OSI Pre-NDA request)." in the past. |
These questions are primarily going out to the sub-team that worked on the Best Practices for Submission of Event Adjudication Data White Paper. The White Paper provided very useful tips on how to map adjudicated data to the new custom SDTM domain of EA. The following are the follow-up questions to this White Paper.
| PHUSE Team Response: 14 July 2020
|
Is exposure data from the parent study required to be in the SDTM data of a follow-up study (no treatment given in follow-up study)? Is it required for FDA and PMDA? Can the exposure data be carried over from the parent study SDTM into the follow-up study SDTM data, or does it need to be re-collected on the CRF of the follow-up study?. | PHUSE Team Response: 9 January 2020 In general, if the data is not collected on the CRF for the follow up study, then it is not recommended to report that into SDTM datasets. In this example, we recommend not to carry it over to SDTM for the follow up study. Instead, this information can be presented in the analysis dataset. |
Is there a Standard in the Industry of how they determine the study start date for clinical studies? Is it the protocol finalised date, first subject in date or first initiation date?. | PHUSE Team Response: 22 November 2018 As per the guidance from the FDA - Providing Regulatory Submission in Electronic Format - Standardised Study Data, 'the study start date for clinical studies is the earliest date of informed consent among any subject that enrolled in the study'. For example, see Study Start Date in the SDTM Trial Summary Domain (TSPARMCD = SSTDTC). For nonclinical studies, the study start date is the date on which the study protocol or plan is approved (signed) by the Study Director, also known as the study initiation date. For example, see Study Start Date in the SEND Trial Summary Domain (TSPARMCD = STSTDTC). This definition is consistent with the Study Data Standardised Plan (SDSP) PHUSE template, which is reviewed and authorised for usage. References FDA Guidance, "Providing Regulatory Submissions In Electronic Format —Standardised Study Data” https://www.fda.gov/downloads/Drugs/Guidances/UCM292334.pdf Study Data Standardised Plan PHUSE template https://phuse.global/Deliverables, direct link to the template |
What goes in the 'misc' folder with an m5 eCTD folder structure? For example, a lookup file containing SMQ assignment. The file is used during the creation of pooled ADaM to support an ISS. We want to provide this dataset to the reviewer. This does not contain subject’s data and it is not SDTM or ADaM. Should this go to the 'misc' folder? or to the analysis folder and described in the define.xml and classified as non-ADaM? or is it enough to describe its structure in the ADRG?. | PHUSE Team Response: 12 April 2018 Name, page 11-12. Information about these additional files and their use in creating the datasets should be included in the reviewer’s guides. |
Does the legacy data in non-CDISC format need to be converted to SDTM for all studies that are part of the FDA or PMDA submissions? If a sponsor has one pivotal study in non-CDISC and the other pivotal study in CDISC, do I need to convert both to CDISC format before submission?. | PHUSE Team Response: 7 June 2017
For other phase I and CP studies that don’t meet above criteria, electronic data required when PMDA needs them for their review. The study types will be those where standard PK analysis conducted, Population PK, and PBPK. Providing Regulatory Submissions in Electronic Format
FDA Non-binding documents and other resources can also be found in the FDA webpage for Study Data Standard Resources.
PMDA Non-Binding documents and other resources can also be found on the PMDA website for Advanced Review with Electronic Data Promotion Group. |
How do I make a test submission to the FDA?. | PHUSE Team Response: 7 June 2017 |
Will JumpStart (DataFit) services be available for Pharma clients before submission? What kind of checks are included in JumpStart?. | PHUSE Team Response: 7 June 2017 JumpStart as a Service is specific to FDA. There are multiple versions of open source validator tools available for use that are similar to the version of DataFit that FDA use. Use of a validator to check for compliance issues and inclusion of a Study Data Reviewer’s Guide will get a Sponsor close to all the information FDA looks at during the data fitness portion of a JumpStart service. The standard demographics analysis panels are available via the GitHub code repository. FDA will be sharing more scripts in the near future. Additional References: https://github.com/phuse-org/phuse-scripts |
When will CDISC (SDTM/ADaM) data standards be mandatory for data submission and how does this differ from for each regulatory agency?. | PHUSE Team Response: 12 September 2017
FDA Submission Type & Timing NDA, ANDA, and certain BLA submissions - Studies which start after 2016-12-17 (December 17th, 2016). Commercial INDs and amendments, except for submissions described in section 561 of the Federal Food, Drug, and Cosmetic Act - Studies which start after 2017-12-17 (December 17th, 2017). For the definition of "study start date," see the Providing Regulatory Submissions in Electronic Format - Standardised Study Data (PDF - 131 KB). Source for FDA: Japan (PMDA): Electronic data submission starts from 01-Oct-2016 with the transition period as noted below. PMDA Submission Type & Timing NDAs (eStudy Data submission criteria*) - Transition Period - Submission on or after 2016-10-01 (October 1st 2016) until 2020-03-31 (March 31st, 2020). All NDAs (eStudy Data submission criteria*) - Submission on or after 2020-04-01 (April 1st, 2020). During the transition period, PMDA accepts the legacy submission and partial data submission (hybrid submissions).
All required study data need to be submitted in CDISC format after April 1st, 2020.
*eStudy Data submission criteria: electronic data in CDISC format needed for studies meeting the following criteria.
Supported standard and versions data standard catalog and validation rules including rejection criteria are available together with applicable guidance on the PMDA Advance Review with Electronic Data promotion group website. Advice to European Medicines on Clinical trial data formats. (30APR2013)
China Food and Drug Administration (CDFA) has endorsed CDISC standards in their Clinical Trial Data Management Technology Guide* (July 2016): they mention "CDISC standards have seen more and more recognised and widely used in the industry, has become an international clinical trial data "common language". |