feat(webstatement): implement batch processing in job data handling

- Menambahkan konstanta `CHUNK_SIZE` dengan nilai 1000 untuk memungkinkan pemrosesan data dalam bentuk batch guna mengurangi penggunaan memori. - Memperkenalkan atribut batch baru (`atmTransactionBatch`, `captureBatch`, `transferBatch`, dan `tellerBatch`) untuk menyimpan data sementara sebelum disimpan ke database. - Mengganti metode penyimpanan langsung dengan menambahkan data ke batch melalui fungsi baru `addToBatch()`. - Menambahkan fungsi `saveBatch()` untuk melakukan operasi penyimpanan data dalam jumlah besar (bulk) menggunakan metode `upsert`. - Memastikan batch akan di-reset setelah penyimpanan, termasuk dalam kasus kegagalan, untuk mencegah reprocessing catatan yang gagal. - Menambahkan logging tambahan untuk mencatat jumlah batch (chunk) yang telah selesai diproses, memberikan wawasan lebih dalam proses. - Menambahkan validasi jumlah kolom pada setiap baris data, logging peringatan jika data tidak sesuai, dan mencatat jumlah kesalahan (`errorCount`). - Secara otomatis menambahkan atribut timestamp (`created_at` dan `updated_at`) pada setiap data sebelum dimasukkan ke dalam batch untuk pelacakan waktu. - Memodifikasi log error untuk menangani kesalahan pada level yang lebih spesifik, seperti pada baris tertentu dalam file yang diproses. - Mengoptimalkan pemrosesan data pada job: - `ProcessAtmTransactionJob` untuk data transaksi ATM. - `ProcessDataCaptureDataJob` untuk data capture. - `ProcessFundsTransferDataJob` untuk data transfer dana. - `ProcessTellerDataJob` untuk data teller. - Meningkatkan efisiensi dan skalabilitas dengan menyisipkan data secara bulk, mengurangi overhead database, dan menghindari pengolahan data secara satu per satu.
2025-05-28 09:29:18 +07:00
parent 30662b97d5
commit 0b607f86cb
4 changed files with 220 additions and 26 deletions
--- a/app/Jobs/ProcessTellerDataJob.php
+++ b/app/Jobs/ProcessTellerDataJob.php
@@ -20,6 +20,7 @@
        private const MAX_EXECUTION_TIME = 86400; // 24 hours in seconds
        private const FILENAME           = 'ST.TELLER.csv';
        private const DISK_NAME          = 'sftpStatement';
+        private const CHUNK_SIZE         = 1000; // Process data in chunks to reduce memory usage
        private const HEADER_MAP         = [
            'id'                => 'id_teller',
            'account_1'         => 'account_1',
@@ -129,6 +130,7 @@
        private string $period = '';
        private int    $processedCount = 0;
        private int    $errorCount     = 0;
+        private array  $tellerBatch    = [];

        /**
         * Create a new job instance.
@@ -166,6 +168,7 @@
            set_time_limit(self::MAX_EXECUTION_TIME);
            $this->processedCount = 0;
            $this->errorCount     = 0;
+            $this->tellerBatch    = [];
        }

        private function processPeriod()
@@ -222,9 +225,23 @@
            }

            $rowCount = 0;
+            $chunkCount = 0;
+
            while (($row = fgetcsv($handle, 0, self::CSV_DELIMITER)) !== false) {
                $rowCount++;
                $this->processRow($headerRow, $row, $rowCount, $filePath);
+
+                // Process in chunks to avoid memory issues
+                if (count($this->tellerBatch) >= self::CHUNK_SIZE) {
+                    $this->saveBatch();
+                    $chunkCount++;
+                    Log::info("Processed chunk $chunkCount ({$this->processedCount} records so far)");
+                }
+            }
+
+            // Process any remaining records
+            if (!empty($this->tellerBatch)) {
+                $this->saveBatch();
            }

            fclose($handle);
@@ -234,6 +251,14 @@
        private function processRow(array $headerRow, array $row, int $rowCount, string $filePath)
        : void
        {
+            // Skip if row doesn't have enough columns
+            if (count($headerRow) !== count($row)) {
+                Log::warning("Row $rowCount in $filePath has incorrect column count. Expected: " .
+                    count($headerRow) . ", Got: " . count($row));
+                $this->errorCount++;
+                return;
+            }
+
            // Combine the header row with the data row
            $rawData = array_combine($headerRow, $row);

@@ -249,10 +274,13 @@
            }

            try {
-                $teller = Teller::firstOrNew(['id_teller' => $data['id_teller']]);
-                $teller->fill($data);
-                $teller->save();
+                // Add timestamps
+                $now = now();
+                $data['created_at'] = $now;
+                $data['updated_at'] = $now;

+                // Add to batch for bulk processing
+                $this->tellerBatch[] = $data;
                $this->processedCount++;
            } catch (Exception $e) {
                Log::error("Error processing Teller at row $rowCount in $filePath: " . $e->getMessage());
@@ -260,6 +288,28 @@
            }
        }

+        private function saveBatch(): void
+        {
+            try {
+                if (!empty($this->tellerBatch)) {
+                    // Bulk insert/update teller records
+                    Teller::upsert(
+                        $this->tellerBatch,
+                        ['id_teller'], // Unique key
+                        array_diff(array_values(self::HEADER_MAP), ['id_teller']) // Update columns
+                    );
+
+                    // Reset batch after processing
+                    $this->tellerBatch = [];
+                }
+            } catch (Exception $e) {
+                Log::error("Error in saveBatch: " . $e->getMessage());
+                $this->errorCount += count($this->tellerBatch);
+                // Reset batch even if there's an error to prevent reprocessing the same failed records
+                $this->tellerBatch = [];
+            }
+        }
+
        private function logJobCompletion()
        : void
        {