Merge pull request #7463 from MicrosoftDocs/JasonHQX-patch-7

Mattp123 · web-flow · commit 0792be7cc4d7 · 2023-01-05T15:37:07.000-08:00
Update export-data-lake-faq.yml
diff --git a/powerapps-docs/maker/data-platform/export-data-lake-faq.yml b/powerapps-docs/maker/data-platform/export-data-lake-faq.yml
@@ -52,10 +52,16 @@ sections:
    - question: When should I use a yearly or monthly partition strategy?
      answer: |
         For Dataverse tables where data volume is high within a year, we recommend you use monthly partitions. Doing so results in smaller files and better performance. Additionally, if the rows in Dataverse tables are updated frequently, splitting into multiple smaller files help improve performance in the case of in-place update scenarios.
-   - question: When do I use Append only mode for a historical view of changes?
+   - question: What is append only mode and what is the difference between append only and in-place update mode?
+     answer: |
+        In append only mode, incremental data from Dataverse tables are appended to the corresponding file partition in the lake. For more information: [Advanced Configuration Options in Azure Synapse Link](azure-synapse-link-advanced-configuration.md)
+   - question: When do I use append only mode for a historical view of changes?
      answer: |
          Append only mode is the recommended option for writing Dataverse table data to the lake, especially when the data volumes are high within a partition with frequently changing data. Again, this is a commonly used and highly recommended option for enterprise customers. Additionally, you can choose to use this mode for scenarios where the intent is to incrementally review changes from Dataverse and process the changes for ETL, AI, and ML scenarios. Append only mode provides a history of changes, instead of the latest change or in place update, and enables several time series from AI scenarios, such as prediction or forecasting analytics based on historical values. 
-   - question: Why do I see duplicated version numbers when I export data using Append only mode?   
+   - question: How do I retrieve the most up-to-date row of each record and exclude deleted rows when I export data in append only mode?
+     answer: |
+        In append only mode, you should identify the latest version of record with the same ID using **VersionNumber** and **SinkModifiedOn** then apply **isDeleted=0** on the latest version.
+   - question: Why do I see duplicated version numbers when I export data using append only mode?   
      answer: |
          For append only mode, if Azure Synapse Link for Dataverse does not get an acknowledgement from the Azure data lake that the data has been committed due to any reason such as network delays, Synapse Link will retry in those scenarios and commit the data again. The downstream consumption should be made resilient to this scenario by filtering data using SinkModifiedOn.     
    - question: Which Dataverse tables are not supported for export?
@@ -67,9 +73,9 @@ sections:
    - question: Does Azure Synapse Link support calculated columns?
      answer: |
         Yes. In Dataverse, the calculated column keeps only the formula information and the real value depends on the base entity column so the calculated column will be updated if and only if all data (base tables) related to calculated columns are exported via Azure Synapse Link.
-   - question: Which Dataverse tables use Append only by default?
+   - question: Which Dataverse tables use append only mode by default?
      answer: |
-        All tables that do not have a createdOn field will be synced using Append only mode by default. This includes relationship tables as well as the ActivityParty table.
+        All tables that do not have a createdOn field will be synced using append only mode by default. This includes relationship tables as well as the ActivityParty table.
    - question: Why does Azure Synapse Link for Dataverse require all resources to be in the same region and what can I do about it?
      answer: |
         To ensure high performance and low latency in addition to preventing egress charges, Synapse Link requires all resources to be located in the same region. If you have a cross-region scenario, you can: