Nokia OSS-Based 5G KPI Troubleshooting

🔧 Field & LAB Proven | 🧠 Interview-Ready | 📊 OSS-Driven

📡 Nokia 5G KPI Optimization & Interview Preparation ⬅ Back to KPI Optimization

Scenario 1: Sudden RRC Setup Success Rate Degradation

OSS Symptoms & Alarms:

PM Counters:
NRRC.ConnEstabSucc.Sum drops 30% in 2 hours

Example (Hourly):

06:00–07:00 → 12,500
07:00–08:00 → 12,100
08:00–09:00 → 8,400
09:00–10:00 → 8,200

Alarms:
N/A (no hardware alarms)

Correlation:
NNGAP.InitCtxtSetupFail.Sum increases with cause
“radioNetwork-resource-not-available”

Example:

Normal hour → 220
Degraded hour → 1,150

Hourly Trend:
Degradation starts at 08:00 AM daily

Expert Troubleshooting Steps:

Step 1: Root Cause Analysis via PM Counters

Check RRC failure breakdown

Counters analyzed (hourly):

NRRC.ConnEstabFail.Sum
- Normal hour → 1,050
- Degraded hour → 4,200
NRRC.ConnFail_Congestion.Sum
- Normal hour → 320
- Degraded hour → 3,150
NRRC.ConnFail_Radio.Sum
- Normal hour → 410
- Degraded hour → 450
NRRC.ConnFail_Terminal.Sum
- Normal hour → 320
- Degraded hour → 350
NPRACH.SuccTotal
- Normal hour → 9,800
- Degraded hour → 9,750

Observation:

Majority of RRC failures are from NRRC.ConnFail_Congestion.Sum
NRRC.ConnFail_Radio.Sum remains stable
NPRACH.SuccTotal remains stable

Conclusion:
RRC degradation is not due to radio conditions or PRACH failures.
Root cause points to control-plane congestion.

Step 2: Resource Saturation Analysis

Analyze control plane resource utilization

Counters analyzed during 08:00–10:00 AM:

NCCE.UtilDL.P95
- Normal hour → 68%
- Degraded hour → 92%
NPRACH.AttTotal
- Normal hour → 10,400
- Degraded hour → 14,800
NRRC.ConnRej.Sum
- Normal hour → 480
- Degraded hour → 2,900
NGAP.UECtxtRelReq.Sum
- Normal hour → 310
- Degraded hour → 1,780

Observation:

PDCCH CCE utilization crosses 90%
RRC rejections rise sharply
Core network context releases increase

Conclusion:
Control-plane resource saturation is confirmed during busy hours.

Step 3: Parameter Configuration Audit

Check current mobility and access parameters

Parameters audited:

acBarringFactor
- Current value → 0.95
rrcConnectionRejectWaitTimer
- Current value → 1000 ms
maxConnectedUsers
- Current value → 200
prachConfigurationIndex
- Current value → 98

Audit window:
Last 7 days

Observation:

No parameter change detected
Values remained constant before degradation

Conclusion:
Issue is traffic-driven, not configuration-driven.

Parameter Optimization Strategy:

Parameter	Pre-Optimization	Post-Optimization	Rationale
acBarringFactor	0.95	0.65	Reduce access attempts during congestion
rrcConnectionRejectWaitTimer	1000 ms	500 ms	Faster retry for rejected UEs
maxConnectedUsers	200	180	Protect existing connections
prachFreqOffset	0	12	Spread RACH attempts across resources
ssbPerRACHOccasion	8	16	Better beam correspondence for initial access

Pre vs Post Optimization Impact:

KPI	Pre-Optimization	Post-Optimization	Δ	Trend
RRC Setup Success Rate	85.2%	96.8%	+11.6%	Improved
RRC Reject Rate	12.5%	2.3%	-10.2%	Reduced
PDCCH CCE Utilization (P95)	92%	78%	-14%	Reduced
Average RRC Setup Time	128 ms	89 ms	-39 ms	Reduced
Initial Context Setup Failures	8.2%	1.1%	-7.1%	Reduced

Final Technical Conclusion

The sudden RRC Setup Success Rate degradation is caused by control-plane congestion during predictable busy hours.
By optimizing access control, retry timing, and RACH distribution, signaling load is stabilized and RRC performance is restored without hardware expansion.

Scenario 2: High RLF Rate in Specific Beam Directions

OSS Symptoms & Alarms:

PM Counters:
NRLF.Detected.Sum spikes in beams 2, 5, 8

Example (Hourly):

Normal beams → 120–180
Beam 2 → 980
Beam 5 → 1,120
Beam 8 → 1,050

Alarms:
No RF alarms, but beam-specific failures observed

Correlation:
High NBFI.Count.Sum in same beams

Example:

Normal beams → 90–140
Beam 2 → 860
Beam 5 → 940
Beam 8 → 910

Pattern:
Occurs during specific hours 18:00–22:00

Expert Troubleshooting Steps:

Step 1: Beam Failure Pattern Analysis

Analyze beam failure patterns using the following counters:

NBFI.Count.Sum
NRLF.Detected.Sum
NRSRP.Beam.Avg
NRSRQ.Beam.Avg

Example observations (18:00–22:00):

Beam 2

NBFI.Count.Sum → 860
NRLF.Detected.Sum → 980
NRSRP.Beam.Avg → −96 dBm
NRSRQ.Beam.Avg → −15 dB

Beam 5

NBFI.Count.Sum → 940
NRLF.Detected.Sum → 1,120
NRSRP.Beam.Avg → −97 dBm
NRSRQ.Beam.Avg → −16 dB

Beam 8

NBFI.Count.Sum → 910
NRLF.Detected.Sum → 1,050
NRSRP.Beam.Avg → −95 dBm
NRSRQ.Beam.Avg → −15 dB

Observation:

RLF and beam failure instances spike only in specific beams
RSRP and RSRQ remain within acceptable range
Issue is beam-specific, not cell-wide

Conclusion:
High RLF is not caused by coverage loss but by beam instability.

Step 2: Beam Management Configuration Audit

Check beam management parameters for affected beams:

Parameters audited:

beamFailureRecoveryTimer
beamFailureInstanceMaxCount
beamReportingPeriodicity
ssbPeriodicity
csiRsDensity

Observed configuration (Pre-Optimization):

beamFailureRecoveryTimer → 100 ms
beamFailureInstanceMaxCount → 5
beamReportingPeriodicity → 160 ms
ssbPeriodicity → 20 ms
csiRsDensity → one

Observation:

Recovery timer too high for fast beam dynamics
Beam reporting periodicity too slow
CSI-RS density insufficient for accurate beam tracking

Conclusion:
Beam management configuration is not optimized for high-mobility or interference-prone scenarios.

Step 3: Inter-beam Interference Analysis

Analyze inter-beam interference using:

serving_beam
interfering_beam
interference_events
interference_level_db

Example observations:

Serving beam 2 interfered by beam 7
- interference_events → 420
- avg_interference → −6 dB
Serving beam 5 interfered by beam 9
- interference_events → 510
- avg_interference → −5 dB
Serving beam 8 interfered by beam 3
- interference_events → 470
- avg_interference → −6 dB

Conclusion:
Significant inter-beam interference exists, leading to frequent beam failures and RLF.

Parameter Optimization Strategy:

Parameter	Pre-Optimization	Post-Optimization	Rationale
beamFailureRecoveryTimer	100 ms	50 ms	Faster beam recovery
beamFailureInstanceMaxCount	5	3	More sensitive beam failure detection
beamReportingPeriodicity	160 ms	80 ms	Faster beam reporting
ssbPeriodicity	20 ms	10 ms	More frequent beam sweeping
csiRsDensity	one	three	Denser CSI-RS for better beam management

Pre vs Post Optimization Impact:

KPI	Pre-Optimization	Post-Optimization	Δ	Trend
Beam Failure Rate	15.2%	3.8%	−11.4%	Reduced
RLF due to Beam Failure	8.5%	1.2%	−7.3%	Reduced
Beam Switch Delay	45 ms	22 ms	−23 ms	Reduced
Beam Measurement Accuracy	78%	92%	+14%	Improved
User Throughput (affected beams)	65 Mbps	142 Mbps	+77 Mbps	Improved

Final Technical Conclusion

The high RLF rate was caused by beam instability combined with inter-beam interference during peak hours.
By optimizing beam recovery timing, reporting periodicity, CSI-RS density, and sweeping frequency, beam robustness improved significantly, resulting in reduced RLF and enhanced user throughput.

Scenario 3: UL Throughput Degradation During Peak Hours

OSS Symptoms & Alarms:

PM Counters:
NTHP.UlMacCellVol drops 40% during 18:00–21:00

Example (Hourly UL Throughput):

16:00–17:00 → 330 Mbps
17:00–18:00 → 310 Mbps
18:00–19:00 → 190 Mbps
19:00–20:00 → 180 Mbps
20:00–21:00 → 195 Mbps

Alarms:
No hardware alarms

Correlation:
High NULInterference.Avg and NPUSCH.PowerHeadroom.Avg negative

Example:

NULInterference.Avg
- Normal hour → −98 dBm
- Peak hour → −92 dBm
NPUSCH.PowerHeadroom.Avg
- Normal hour → 4.2 dB
- Peak hour → −2.5 dB

Pattern:
Coincides with UL interference increase during peak hours

Expert Troubleshooting Steps:

Step 1: UL Interference Analysis

Analyze UL interference patterns using the following counters:

NULInterference.Avg
NULInterference.Max
NTHP.UlMacCellVol
NPUSCH.TxPower.Avg
NBLER.UL.Avg

Example observations (hourly):

18:00–19:00

NULInterference.Avg → −92 dBm
NULInterference.Max (P95) → −88 dBm
NTHP.UlMacCellVol → 190 Mbps
NPUSCH.TxPower.Avg → 22.8 dBm
NBLER.UL.Avg → 15.6%

19:00–20:00

NULInterference.Avg → −91 dBm
NULInterference.Max (P95) → −87 dBm
NTHP.UlMacCellVol → 180 Mbps
NPUSCH.TxPower.Avg → 23.1 dBm
NBLER.UL.Avg → 16.2%

Observation:

UL interference increases significantly during peak hours
UL throughput drops sharply in the same period
UE transmit power approaches maximum
UL BLER increases beyond acceptable threshold

Conclusion:
UL throughput degradation is driven by high uplink interference.

Step 2: Power Control Parameter Audit

Check UL power control configuration parameters:

Parameters audited:

p0NominalPUSCH
alpha
deltaMCS-Enabled
ulTargetBLER
srsPeriodicity
bsrTimer

Observed configuration (Pre-Optimization):

p0NominalPUSCH → −76 dBm
alpha → 0.8
deltaMCS-Enabled → FALSE
ulTargetBLER → 10%
srsPeriodicity → 20 ms
bsrTimer → 20 ms

Observation:

Target PUSCH power too low for interference conditions
Partial path loss compensation applied
MCS-based power adjustment disabled
UL BLER target too relaxed

Conclusion:
UL power control configuration is not optimized for high-interference peak hours.

Step 3: Scheduling Analysis for UL

Analyze UL scheduler behavior using:

NPRB.UtilUL.Avg
NPUSCH.Scheduled.UEs
NBSR.Received.Sum
NUL.Sched.Delay.Avg

Example observations (18:00–21:00):

NPRB.UtilUL.Avg → 88%
NPUSCH.Scheduled.UEs → 64
NBSR.Received.Sum → 18,500
NUL.Sched.Delay.Avg → 14.2 ms

Observation:

High UL PRB utilization
Increased scheduling delay
Large number of buffer status reports indicating backlog

Conclusion:
UL scheduler is stressed due to interference-driven retransmissions and power limitations.

Parameter Optimization Strategy:

Parameter	Pre-Optimization	Post-Optimization	Rationale
p0NominalPUSCH	−76 dBm	−70 dBm	Increase target power to overcome interference
alpha	0.8	1.0	Full path loss compensation
deltaMCS-Enabled	FALSE	TRUE	Enable MCS-based power adjustment
ulTargetBLER	10%	5%	More aggressive MCS selection
srsPeriodicity	20 ms	40 ms	Reduce SRS overhead for more PUSCH
bsrTimer	20 ms	10 ms	Faster BSR reporting

Pre vs Post Optimization Impact:

KPI	Pre-Optimization	Post-Optimization	Δ	Trend
UL Throughput (Peak Hour)	125 Mbps	320 Mbps	+195 Mbps	Improved
UL PRB Utilization	88%	75%	−13%	Reduced
UL BLER	15.2%	6.8%	−8.4%	Reduced
PUSCH Tx Power Headroom	−2.5 dB	3.8 dB	+6.3 dB	Improved
UL Interference	−92 dBm	−98 dBm	+6 dB	Improved

Scenario 4: Intra-Frequency Handover Failure Increase

OSS Symptoms & Alarms:

PM Counters:
NHO.FailIntraFreq.Sum increases from 2% to 12%

Example (Daily Average):

Normal period → 2.1%
Degraded period → 11.8%

Alarms:
No neighbor relation alarms

Correlation:
Failures concentrated in specific neighbor pairs

Example:

CELL_A → CELL_B → 420 failures
CELL_C → CELL_D → 390 failures
Other pairs → < 50 failures

Pattern:
Affects cells with overlapping coverage areas

Expert Troubleshooting Steps:

Step 1: HO Failure Analysis by Neighbor Pair

Analyze HO failures between specific cell pairs using:

NHO.FailIntraFreq.Sum
ho_success
failure_cause = too-late
failure_cause = too-early
failure_cause = wrong-cell
rsrp_source
rsrp_target

Example observations (Top failing pairs):

CELL_A → CELL_B

total_ho_attempts → 3,200
successful_ho → 2,760
too-late → 290
too-early → 95
wrong-cell → 55
avg_source_rsrp → −103 dBm
avg_target_rsrp → −110 dBm

CELL_C → CELL_D

total_ho_attempts → 2,850
successful_ho → 2,420
too-late → 260
too-early → 110
wrong-cell → 60
avg_source_rsrp → −104 dBm
avg_target_rsrp → −111 dBm

Observation:

Majority of failures are too-late
Target cell RSRP is significantly weaker at HO execution
Failures are localized to overlapping coverage regions

Conclusion:
HO triggering occurs too late in overlapping coverage scenarios.

Step 2: Mobility Parameter Configuration Audit

Compare mobility parameters between problematic cells:

Parameters analyzed:

a3Offset
hysteresis
timeToTrigger
cellIndividualOffset
qOffsetCell

Example comparison (CELL_A vs CELL_B):

a3Offset → CELL_A: dB3, CELL_B: dB2, difference: 1 dB
hysteresis → CELL_A: dB2, CELL_B: dB1, difference: 1 dB
timeToTrigger → CELL_A: 480 ms, CELL_B: 320 ms, difference: 160 ms
cellIndividualOffset → CELL_A: 0 dB, CELL_B: +3 dB, difference: 3 dB
qOffsetCell → CELL_A: 0 dB, CELL_B: 0 dB, difference: 0 dB

Observation:

HO triggering thresholds are misaligned between neighbor cells
Longer timeToTrigger delays HO execution
No bias applied toward the stronger target cell

Conclusion:
Mobility parameter mismatch is contributing to late HO execution.

Step 3: Measurement Reporting Analysis

Analyze measurement report quality using:

mr_quality
mr_delay
ue_id

Example observations (last 6 hours):

CELL_A

median_mr_quality → 62
avg_reporting_delay → 145 ms
unique_reporting_ues → 380

CELL_B

median_mr_quality → 58
avg_reporting_delay → 162 ms
unique_reporting_ues → 360

Observation:

Measurement quality degrades near HO region
Reporting delay increases during mobility
Fewer UEs report timely measurements

Conclusion:
Delayed and filtered measurements worsen late HO behavior.

Parameter Optimization Strategy:

Parameter	Pre-Optimization	Post-Optimization	Rationale
a3Offset	dB3	dB2	Earlier handover trigger
hysteresis	dB2	dB1	Reduce measurement filtering
timeToTriggerA3	480 ms	320 ms	Faster reaction to changing conditions
cellIndividualOffset	0 dB	+3 dB (for target)	Boost target cell attractiveness
filterCoefficientRSRP	fc4	fc2	Faster RSRP filtering
reportAmountA3	infinity	4	Limit excessive reporting

Pre vs Post Optimization Impact:

KPI	Pre-Optimization	Post-Optimization	Δ	Trend
Intra-Freq HO Success Rate	87.5%	98.2%	+10.7%	Improved
HO Failure (Too Late)	6.2%	0.8%	−5.4%	Reduced
HO Failure (Too Early)	3.1%	0.5%	−2.6%	Reduced
Ping-Pong HOs	8.5%	2.1%	−6.4%	Reduced
Average HO RSRP	−112 dBm	−105 dBm	+7 dB	Improved

Final Technical Conclusion

The intra-frequency HO failure increase was caused by late HO triggering due to mobility parameter mismatch and delayed measurement reporting in overlapping coverage areas.
After aligning A3 thresholds, reducing filtering, and optimizing reporting behavior, HO performance improved significantly with reduced failures and ping-pong events.

Scenario 5: PDU Session Establishment Failures for URLLC Slice

OSS Symptoms & Alarms:

PM Counters:
NPDU.SessEstabFail.Sum for SNSSAI 010203 increases

Example (Hourly):

Normal hour → 120
Degraded hour → 1,450

Alarms:
Slice resource utilization alarms observed

Correlation:
Failures occur when NSlice.RB.Util.SNSSAI_010203 > 80%

Example:

NSlice.RB.Util.SNSSAI_010203 (normal) → 65%
NSlice.RB.Util.SNSSAI_010203 (peak) → 92%

Pattern:
Affects only URLLC slice (SNSSAI 010203)
eMBB slice (SNSSAI 010101) remains unaffected

Expert Troubleshooting Steps:

Step 1: Slice Resource Analysis

Analyze slice resource utilization and failures using the following counters:

NSlice.RB.Util.SNSSAI_010203
NSlice.UE.Count.SNSSAI_010203
NPDU.SessEstabAtt.SNSSAI_010203
NPDU.SessEstabFail.SNSSAI_010203
NPDU.SessEstabAtt.SNSSAI_010101
NPDU.SessEstabFail.SNSSAI_010101

Example observations (18:00–21:00):

URLLC Slice – SNSSAI 010203

NSlice.RB.Util.SNSSAI_010203 → 92%
NSlice.UE.Count.SNSSAI_010203 → 86
NPDU.SessEstabAtt.SNSSAI_010203 → 2,350
NPDU.SessEstabFail.SNSSAI_010203 → 1,450

eMBB Slice – SNSSAI 010101

NSlice.RB.Util.SNSSAI_010101 → 58%
NPDU.SessEstabAtt.SNSSAI_010101 → 3,200
NPDU.SessEstabFail.SNSSAI_010101 → 95

Observation:

High PDU session failures observed only for URLLC slice
eMBB slice shows normal behavior
Failures strongly correlate with URLLC RB utilization crossing 80%

Conclusion:
PDU session failures are caused by URLLC slice resource exhaustion.

Step 2: QoS Policy Configuration Audit

Check URLLC slice QoS configuration using:

param_name
param_value
expected_value
compliance_status

Example audit results (SNSSAI 010203):

guaranteedFlowBitRateUL
- configured → 10 Mbps
- expected → 50 Mbps
- compliance_status → NON_COMPLIANT
packetDelayBudget
- configured → 20 ms
- expected → 10 ms
- compliance_status → NON_COMPLIANT
preemptionCapability
- configured → may-not-preempt
- expected → may-preempt
- compliance_status → NON_COMPLIANT
preemptionVulnerability
- configured → preemptable
- expected → not-preemptable
- compliance_status → NON_COMPLIANT

Observation:

URLLC QoS policies are not aligned with strict latency and priority requirements
URLLC traffic cannot preempt lower-priority traffic

Conclusion:
QoS misalignment contributes to session establishment failures under load.

Step 3: Admission Control Analysis

Analyze admission control decisions for URLLC slice using:

requested_snssai
requested_5qi
decision
rejection_reason
available_resources
required_resources

Example observations (last 2 hours):

UE-A
- requested_snssai → 010203
- requested_5qi → 6
- decision → REJECT
- rejection_reason → INSUFFICIENT_RB
- available_resources → 18 RBs
- required_resources → 30 RBs
UE-B
- requested_snssai → 010203
- requested_5qi → 6
- decision → REJECT
- rejection_reason → SLICE_CAPACITY_LIMIT
- available_resources → 15 RBs
- required_resources → 28 RBs

Observation:

URLLC session requests rejected due to lack of guaranteed resources
Admission control blocks URLLC when slice utilization is high

Conclusion:
Admission control thresholds are too restrictive for URLLC traffic.

Parameter Optimization Strategy:

Parameter	Pre-Optimization	Post-Optimization	Rationale
sliceMaxRBPercentage	20%	30%	Increase resource allocation for URLLC
guaranteedFlowBitRateUL	10 Mbps	50 Mbps	Increase guaranteed rate for URLLC
packetDelayBudget	20 ms	10 ms	Tighter delay budget for URLLC
preemptionCapability	may-not-preempt	may-preempt	Allow URLLC to preempt eMBB
preemptionVulnerability	preemptable	not-preemptable	Protect URLLC from preemption
5qi6MaxRetxThreshold	4	2	Fewer retransmissions for lower latency

Pre vs Post Optimization Impact:

KPI	Pre-Optimization	Post-Optimization	Δ	Trend
URLLC PDU Session Success Rate	71.5%	99.2%	+27.7%	Improved
URLLC Slice RB Utilization	92%	75%	−17%	Reduced
URLLC Latency (5QI 6)	28 ms	12 ms	−16 ms	Reduced
URLLC Packet Loss Rate	1.8%	0.1%	−1.7%	Reduced
eMBB Impact (Throughput)	0%	−8%	−8%	Acceptable

Final Technical Conclusion

The PDU session establishment failures were caused by URLLC slice resource exhaustion combined with misaligned QoS and admission control policies.
After increasing URLLC resource allocation, enabling preemption, and tightening QoS parameters, URLLC session success rate and latency improved significantly with minimal acceptable impact on eMBB traffic.

Scenario 6: DL Throughput Degradation with High MCS but Low Rank

OSS Symptoms & Alarms:

PM Counters:
High NMCS.Avg (24–27) but low NMIMO.Rank.Avg (1.2–1.5)

Example (Affected UE Categories):

NMCS.DL.Avg → 25.8
NMIMO.Rank.Avg → 1.3

Alarms:
No MIMO hardware alarms

Correlation:
Occurs when NUL.SRS.SNR.Avg < 5 dB

Example:

Normal condition → 8.1 dB
Degraded condition → 4.2 dB

Pattern:
Affects specific UE categories (e.g., Category X)

Expert Troubleshooting Steps:

Step 1: MIMO Performance Analysis

Analyze MIMO and SRS performance correlation using the following counters:

NMIMO.Rank.Avg
NMCS.DL.Avg
NUL.SRS.SNR.Avg
NCQI.Avg
NTHP.DlUeVol

Example observations (per UE category):

UE Category X

NMIMO.Rank.Avg → 1.3
NMCS.DL.Avg → 26.2
NUL.SRS.SNR.Avg → 4.2 dB
NCQI.Avg → 11.8
NTHP.DlUeVol → 185 Mbps
sample_size → 420 UEs

UE Category Y

NMIMO.Rank.Avg → 2.4
NMCS.DL.Avg → 25.1
NUL.SRS.SNR.Avg → 7.6 dB
NCQI.Avg → 13.9
NTHP.DlUeVol → 310 Mbps
sample_size → 380 UEs

Observation:

High MCS values are maintained
MIMO rank selection remains low for Category X UEs
DL throughput is limited despite good MCS
Low SRS SNR correlates strongly with low rank selection

Conclusion:
DL throughput degradation is caused by poor uplink channel sounding quality, not modulation limitation.

Step 2: SRS Configuration Audit

Check SRS configuration for different UE categories:

Parameters audited:

srs_bandwidth
srs_periodicity
srs_max_ports
srs_power_control

Example configuration (Category X):

srs_bandwidth → BW4
srs_periodicity → 20 ms
srs_max_ports → 2
srs_power_control → Enabled
ue_count → 420

Observation:

SRS bandwidth too narrow for accurate channel estimation
SRS periodicity too long for fast channel variations
Limited SRS ports restrict MIMO rank estimation

Conclusion:
SRS configuration is insufficient to support higher MIMO ranks.

Step 3: Channel Correlation Analysis

Analyze channel correlation metrics using:

correlation_level
rank_selected
throughput_mbps
mcs

Example observations (last 6 hours):

High Correlation

sample_count → 1,250
avg_rank → 1.2
avg_throughput → 190 Mbps
avg_mcs → 26

Medium Correlation

sample_count → 980
avg_rank → 2.1
avg_throughput → 315 Mbps
avg_mcs → 25

Low Correlation

sample_count → 760
avg_rank → 3.4
avg_throughput → 420 Mbps
avg_mcs → 24

Observation:

High channel correlation results in rank-1 or rank-2 selection
Lower correlation enables higher MIMO layers and throughput
MCS remains high across all correlation levels

Conclusion:
High channel correlation combined with poor SRS quality limits rank adaptation.

Parameter Optimization Strategy:

Parameter	Pre-Optimization	Post-Optimization	Rationale
srsBandwidth	BW4	BW2	Wider SRS for better channel estimation
srsPeriodicity	20 ms	5 ms	More frequent SRS for fast-changing channels
srsMaxPorts	2	4	Enable more SRS ports for better MIMO
codebookSubsetRestriction	fully-restricted	partially-restricted	Allow more precoding flexibility
pmiRiReportPeriodicity	80 ms	20 ms	Faster PMI/RI reporting
csiRsDensity	one	three	Denser CSI-RS for better channel estimation

Pre vs Post Optimization Impact:

KPI	Pre-Optimization	Post-Optimization	Δ	Trend
Average Rank	1.3	2.8	+1.5	Improved
DL Throughput (Category X UEs)	185 Mbps	420 Mbps	+235 Mbps	Improved
SRS SNR	4.2 dB	8.5 dB	+4.3 dB	Improved
MIMO Layer Utilization	32%	68%	+36%	Improved
CQI Reporting Accuracy	65%	88%	+23%	Improved

Final Technical Conclusion

The DL throughput degradation occurred due to poor uplink sounding reference quality, which limited accurate MIMO rank estimation despite high MCS values.
After optimizing SRS bandwidth, periodicity, reporting frequency, and CSI-RS density, MIMO rank utilization improved significantly, resulting in substantial DL throughput gains.

Scenario 7: Latency Spikes for Gaming / AR Services (5QI = 79)

OSS Symptoms & Alarms:

PM Counters:
NDelay.UP.E2E.5QI_79.P95 spikes from 25 ms to 65 ms during evening hours

Example (Hourly P95 Latency):

16:00–17:00 → 24 ms
17:00–18:00 → 26 ms
18:00–19:00 → 52 ms
19:00–20:00 → 61 ms
20:00–21:00 → 65 ms

Alarms:
“Packet Delay Threshold Exceeded” for 5QI = 79

Correlation:
High NRLC.ReasTimeout.Sum and NHARQ.Retx.Avg

Example:

NRLC.ReasTimeout.Sum
- Normal hour → 180
- Peak hour → 1,250
NHARQ.Retx.Avg
- Normal hour → 1.2
- Peak hour → 3.8

Pattern:
Coincides with peak gaming traffic during 18:00–23:00

Troubleshooting Steps:

Step 1: Latency Component Analysis

Decompose E2E latency by protocol layer using:

NDelay.PDCP.Tx.Avg
NDelay.RLC.Proc.Avg
NDelay.MAC.Sched.Avg
NDelay.HARQ.RTT.Avg
NDelay.UP.E2E.Avg

Example observations (per minute, peak hour):

NDelay.PDCP.Tx.Avg → 3.5 ms
NDelay.RLC.Proc.Avg → 18.2 ms
NDelay.MAC.Sched.Avg → 14.8 ms
NDelay.HARQ.RTT.Avg → 12.0 ms
NDelay.UP.E2E.Avg → 64.5 ms
gaming_sessions (5QI=79) → 420 active sessions

Observation:

RLC processing delay and HARQ RTT dominate E2E latency
PDCP delay remains low
MAC scheduling delay increases during congestion

Conclusion:
Latency spike is mainly caused by RLC retransmissions and HARQ retries under peak load.

Step 2: Gaming Traffic Pattern Analysis

Analyze gaming traffic characteristics using:

five_qi
packet_size (P95)
packets_per_second
inter_arrival_time_ms
ue_id

Example observations:

5QI = 79 (Gaming / AR)

p95_packet_size → 120 bytes
avg_packet_rate → 920 packets/sec
avg_inter_arrival → 1.1 ms
active_gamers → 420 UEs

5QI = 80

p95_packet_size → 220 bytes
avg_packet_rate → 410 packets/sec
avg_inter_arrival → 3.5 ms
active_gamers → 180 UEs

5QI = 6

p95_packet_size → 1,200 bytes
avg_packet_rate → 95 packets/sec
avg_inter_arrival → 12 ms
active_gamers → 90 UEs

Observation:

5QI=79 traffic consists of very small packets at very high frequency
Highly sensitive to buffering, retransmissions, and scheduling delay

Conclusion:
Default QoS handling is not optimal for bursty, latency-critical gaming traffic.

Step 3: QoS Policy Verification

Check gaming QoS policy configuration using:

pdcp_sn_size
rlc_mode
dl_data_split_threshold
scheduling_priority
preemption_capability
preemption_vulnerability

Example configuration (5QI = 79):

pdcp_sn_size → 18 bits
rlc_mode → AM
dl_data_split_threshold → 100 bytes
scheduling_priority → Medium
preemption_capability → disabled
preemption_vulnerability → preemptable

Observation:

RLC AM introduces retransmission delays
PDCP SN size adds unnecessary overhead for small packets
No semi-persistent scheduling configured

Conclusion:
QoS policy is not tuned for ultra-low latency gaming services.

Parameter Optimization Strategy:

Parameter	Pre-Optimization	Post-Optimization	Rationale
pdcpSnSize (5QI=79)	18 bits	12 bits	Reduced SN overhead for gaming packets
rlcMode (5QI=79)	AM	UM	Eliminate RLC retransmission delay
dlDataSplitThreshold	100 bytes	50 bytes	Faster transmission of small gaming packets
harqMaxRetx (5QI=79)	4	2	Fewer retransmissions for latency-sensitive traffic
spsInterval (5QI=79)	disabled	10 ms	Semi-persistent scheduling for periodic gaming traffic
drxInactivityTimer	20 ms	5 ms	Shorter inactivity for responsive gaming

Pre vs Post Optimization Impact:

KPI	Pre-Optimization	Post-Optimization	Δ	Impact
95th Percentile Latency (5QI=79)	65 ms	28 ms	−37 ms	Significant Improvement
Packet Delay Variation (Jitter)	22 ms	8 ms	−14 ms	Excellent
Gaming Packet Loss Rate	2.1%	0.4%	−1.7%	Excellent
RLC Reassembly Timeouts	8.5%	1.2%	−7.3%	Excellent
HARQ Round Trip Time	12 ms	8 ms	−4 ms	Good
Overall Cell Throughput	–	−2%	−2%	Minor Impact

Final Technical Conclusion

Latency spikes for gaming and AR services (5QI=79) were caused by RLC retransmissions, excessive HARQ retries, and non-optimized QoS policies during peak gaming hours.
After switching to RLC UM, reducing retransmissions, enabling SPS, and optimizing PDCP and DRX parameters, latency and jitter were significantly reduced with only a minor, acceptable impact on overall cell throughput.

Scenario 8: Persistent High DL BLER in Macro Cell

OSS Symptoms & Alarms:

PM Counters:
NBLER.DL.Avg consistently > 15% (threshold: 10%)

Example (Hourly Average):

Normal period → 8.5%
Degraded period → 16.8%
Peak hour → 18.2%

Alarms:
“Radio Link Quality Degraded” alarm active

Correlation:
High NRLC.RetxDL.Sum and low NCQI.Avg

Example:

NRLC.RetxDL.Sum
- Normal → 4,800
- Degraded → 18,900
NCQI.Avg
- Normal → 10.8
- Degraded → 8.2

Pattern:
Affects all UEs in sector 2, not localized to specific users or locations

Troubleshooting Steps:

Step 1: BLER Analysis by UE Category & Location

Analyze BLER patterns across UE categories using:

NBLER.DL.Avg
NCQI.Avg
NMCS.DL.Avg
NRSRP.DL.Avg
NSINR.DL.Avg
azimuth_degrees

Example observations (Sector 2):

UE Category 4

affected_ues → 180
NBLER.DL.Avg → 17.2%
NCQI.Avg → 8.0
NMCS.DL.Avg → 22.5
NRSRP.DL.Avg → −96 dBm
NSINR.DL.Avg → 11.2 dB
median_azimuth → 210°

UE Category 6

affected_ues → 240
NBLER.DL.Avg → 16.4%
NCQI.Avg → 8.4
NMCS.DL.Avg → 23.1
NRSRP.DL.Avg → −95 dBm
NSINR.DL.Avg → 10.8 dB
median_azimuth → 208°

Observation:

High BLER across all UE categories
BLER not dependent on UE type or specific location
RSRP and SINR are moderate, not severely degraded

Conclusion:
Issue is cell-wide link adaptation, not UE-specific radio coverage.

Step 2: Link Adaptation Performance Analysis

Check link adaptation effectiveness using:

NBLER.DL.Avg
NBLER.Target.Avg
NMCS.DL.Avg
NCQI.Avg
NCQI.ReportingDelay.Avg

Example observations (hourly):

18:00–19:00

actual_bler → 17.8%
target_bler → 10%
mcs_used → 23.0
reported_cqi → 8.1
cqi_reporting_delay → 95 ms
high_bler_samples → 420

19:00–20:00

actual_bler → 18.2%
target_bler → 10%
mcs_used → 23.4
reported_cqi → 8.0
cqi_reporting_delay → 98 ms
high_bler_samples → 460

Observation:

Actual BLER significantly exceeds target BLER
MCS selection is aggressive despite low CQI
CQI feedback is delayed

Conclusion:
Link adaptation loop is not reacting fast enough to channel degradation.

Step 3: RF Configuration and Beam Analysis

Analyze RF parameters and beam performance using:

param_name
current_value
recommended_value
deviation_percentage
impact_on_bler

Example audit findings (Top contributors):

pdschTargetBlerDl
- current → 10%
- recommended → 5%
- deviation → +100%
- impact_on_bler → High
cqiTableIndex
- current → 1 (256QAM)
- recommended → 2 (64QAM)
- deviation → Mismatch
- impact_on_bler → High
dlAlpha (OLPC)
- current → 0.8
- recommended → 0.6
- deviation → +33%
- impact_on_bler → Medium

Observation:

BLER target is too aggressive
CQI and MCS tables favor high throughput over reliability
Power control loop insufficiently conservative

Conclusion:
RF and link adaptation parameters are tuned too aggressively for macro coverage.

Parameter Optimization Strategy:

Parameter	Pre-Optimization	Post-Optimization	Rationale
pdschTargetBlerDl	10%	5%	More conservative target for better reliability
cqiTableIndex	1 (256QAM)	2 (64QAM)	Use more robust CQI table
mcsTable	256QAM	64QAM	Conservative MCS for better BLER
dlAlpha (OLPC)	0.8	0.6	More conservative outer loop power control
initialMcsDl	20	15	Start with lower MCS for new connections
csiReportPeriodicity	80 ms	40 ms	Faster CSI feedback for better adaptation

Pre vs Post Optimization Impact:

KPI	Pre-Optimization	Post-Optimization	Δ	Impact
Average DL BLER	16.8%	7.2%	−9.6%	Excellent
RLC DL Retransmissions	18.5%	8.2%	−10.3%	Excellent
Average CQI	8.2	10.5	+2.3	Good
DL Throughput	320 Mbps	280 Mbps	−40 Mbps	Acceptable Trade-off
User Experience (MOS)	3.2	3.9	+0.7	Improved
RLF Rate	5.2%	2.1%	−3.1%	Excellent

Final Technical Conclusion

The persistent high DL BLER in the macro cell was caused by over-aggressive link adaptation and RF parameter configuration, not by poor coverage or UE limitations.
After adopting more conservative BLER targets, robust CQI/MCS tables, faster CSI feedback, and tuned power control, DL reliability improved significantly with an acceptable throughput trade-off.

Scenario 9: VoNR MOS Score Degradation in Dense Urban

OSS Symptoms & Alarms:

PM Counters:
NMOS.Avg.5QI_1 drops from 4.1 to 3.2

Example (Hourly Average):

Normal period → 4.1
Degraded period → 3.4
Peak degradation → 3.2

Alarms:
“Voice Quality Degradation” alarm active for multiple cells

Correlation:
High NPDV.5QI_1.StdDev (>20 ms) and NPacketLoss.5QI_1.Avg (>2%)

Example:

NPDV.5QI_1.StdDev
- Normal → 9 ms
- Degraded → 25 ms
NPacketLoss.5QI_1.Avg
- Normal → 0.4%
- Degraded → 2.5%

Pattern:
Affects handover regions between CELL_12, CELL_13, CELL_14

Troubleshooting Steps:

Step 1: VoNR Quality Metrics Correlation

Correlate MOS with underlying metrics using:

NMOS.Avg
NPacketLoss.5QI_1.Avg
NPDV.5QI_1.StdDev
NDelay.UP.E2E.5QI_1.Avg
NBLER.UL.5QI_1.Avg
NROHC.CompressionRatio.Avg

Example observations (last 2 hours):

CELL_12 → CELL_13

avg_mos → 3.3
packet_loss → 2.4%
jitter → 24 ms
latency → 38 ms
ul_bler → 6.5%
rohc_ratio → 1.9:1
call_count → 420

CELL_13 → CELL_14

avg_mos → 3.2
packet_loss → 2.7%
jitter → 26 ms
latency → 42 ms
ul_bler → 7.2%
rohc_ratio → 1.8:1
call_count → 390

Observation:

MOS degradation correlates strongly with jitter and packet loss
UL BLER increases during mobility
ROHC compression efficiency is low

Conclusion:
VoNR quality degradation is driven by packet loss, jitter, and inefficient header compression, especially during handovers.

Step 2: Handover Impact on VoNR Quality

Analyze VoNR quality degradation during handovers using:

ho_type
mos_before_ho
mos_after_ho
mos_drop
ho_interruption_time
packet_loss_during_ho

Example observations:

Intra-Freq HO

pre_ho_mos → 4.0
post_ho_mos → 3.3
avg_mos_drop → 0.7
interruption_time → 85 ms
ho_packet_loss → 2.1%
sample_count → 320

Inter-gNB HO

pre_ho_mos → 4.1
post_ho_mos → 3.2
avg_mos_drop → 0.8
interruption_time → 110 ms
ho_packet_loss → 2.6%
sample_count → 280

Observation:

MOS drop occurs primarily during HO execution
Longer interruption time leads to higher packet loss

Conclusion:
Handover execution time and interruption directly impact VoNR MOS.

Step 3: ROHC Performance Analysis

Check ROHC compression efficiency and failures using:

NROHC.CompressionRatio.Avg
NROHC.FailureRate.Avg
NPDU.HeaderSize.Avg
NPDU.PayloadSize.Avg

Example observations:

UE Category 3

compression_ratio → 1.8:1
failure_rate → 6.5%
avg_header_size → 42 bytes
avg_payload_size → 33 bytes
ue_count → 260

UE Category 6

compression_ratio → 1.9:1
failure_rate → 5.8%
avg_header_size → 40 bytes
avg_payload_size → 34 bytes
ue_count → 310

Observation:

Header size comparable to payload size
Compression failure rate high for VoNR
ROHC contexts insufficient for concurrent calls

Conclusion:
ROHC inefficiency contributes to packet loss and jitter during mobility.

Parameter Optimization Strategy:

Parameter	Pre-Optimization	Post-Optimization	Rationale
rohcMaxCid	5	15	More compression contexts for concurrent VoNR calls
rohcProfile	0x0001	0x0006	Use optimized profile for voice traffic
ttiBundling (5QI=1)	disabled	enabled	TTI bundling for better UL coverage in voice
ulTargetBler (5QI=1)	10%	1%	Ultra-low BLER target for voice
spsInterval (5QI=1)	disabled	20 ms	SPS for consistent voice packet scheduling
hoExecutionTimer	1000 ms	500 ms	Faster handover execution for voice

Pre vs Post Optimization Impact:

KPI	Pre-Optimization	Post-Optimization	Δ	Impact
Average MOS Score	3.2	4.0	+0.8	Excellent
Packet Loss Rate (5QI=1)	2.5%	0.3%	−2.2%	Excellent
Jitter (Packet Delay Variation)	25 ms	8 ms	−17 ms	Excellent
ROHC Compression Ratio	1.8:1	3.5:1	+1.7×	Excellent
Handover MOS Drop	0.8	0.2	−0.6	Excellent
VoNR Call Drop Rate	1.8%	0.4%	−1.4%	Excellent

Final Technical Conclusion

The VoNR MOS degradation in dense urban areas was caused by handover-induced packet loss, high jitter, UL BLER, and inefficient ROHC compression.
By optimizing ROHC contexts, enabling SPS and TTI bundling, tightening UL BLER targets, and reducing HO execution time, VoNR quality was restored to near-ideal levels across all affected cells.

Scenario 10: Latency Optimization for Industrial IoT (URLLC)

OSS Symptoms & Alarms:

PM Counters:
NDelay.UP.E2E.5QI_80.P99 > 50 ms (requirement: 20 ms)

Example (Latency Distribution):

Normal period → 18 ms
Degraded period → 52 ms
Peak violation → 58 ms

Alarms:
“URLLC Service Level Agreement Violation”

Correlation:
High NPDCP.ReorderingDelay.Avg and increased scheduling delays

Example:

NPDCP.ReorderingDelay.Avg
- Normal → 2.5 ms
- Degraded → 12.8 ms
Scheduling delay (5QI=80)
- Normal → 3.2 ms
- Degraded → 14.5 ms

Pattern:
Affects specific time-critical industrial applications (robot control, motion control)

Troubleshooting Steps:

Step 1: URLLC Traffic Pattern Analysis

Analyze URLLC traffic characteristics using:

packet_size_bytes
packets_per_second
e2e_delay (P99 / P99.9)
reliability_percentage
transaction_count

Example observations (last 1 hour):

Motion Control Application

avg_packet_size → 64 bytes
packet_rate → 1,200 packets/sec
p99_latency → 55 ms
p999_latency → 82 ms
reliability → 99.92%
transaction_count → 18,500

PLC Control Application

avg_packet_size → 72 bytes
packet_rate → 980 packets/sec
p99_latency → 48 ms
p999_latency → 74 ms
reliability → 99.94%
transaction_count → 15,200

Observation:

Very small packets with extremely high frequency
Tail latency (P99, P99.9) violates URLLC SLA
Reliability slightly below URLLC target

Conclusion:
Latency spikes are driven by tail latency accumulation, not average delay.

Step 2: Scheduling Priority Analysis

Check scheduling behavior for URLLC traffic using:

scheduling_delay_5qi_80
scheduling_delay_5qi_9
priority_weight_5qi_80
preemption_count_5qi_80

Example observations (INDUSTRIAL_CELL_01):

Scheduler: Proportional Fair

urllc_sched_delay → 14.2 ms
embb_sched_delay → 6.8 ms
urllc_priority → 0.35
urllc_preemptions → 2
unique_ues_scheduled → 46

Scheduler: QoS-Aware

urllc_sched_delay → 6.1 ms
embb_sched_delay → 8.9 ms
urllc_priority → 0.75
urllc_preemptions → 9
unique_ues_scheduled → 44

Observation:

URLLC traffic not consistently prioritized
Insufficient preemption of eMBB traffic
Scheduler behavior contributes to latency tail

Conclusion:
Scheduling priority for URLLC is insufficient during congestion.

Step 3: End-to-End Delay Breakdown

Break down URLLC latency components using:

delay_component
delay_ms (Avg / P95 / P99)

Example observations (last 30 minutes):

PDCP Reordering

avg_delay → 10.5 ms
p95_delay → 18 ms
p99_delay → 25 ms
sample_count → 9,800

MAC Scheduling

avg_delay → 12.8 ms
p95_delay → 22 ms
p99_delay → 31 ms
sample_count → 9,800

HARQ Processing

avg_delay → 6.2 ms
p95_delay → 10 ms
p99_delay → 14 ms
sample_count → 9,800

Observation:

PDCP reordering and MAC scheduling dominate tail latency
Combined delays exceed URLLC SLA at P99

Conclusion:
End-to-end URLLC latency violation is caused by scheduler delay + PDCP reordering.

Parameter Optimization Strategy:

Parameter	Pre-Optimization	Post-Optimization	Rationale
pdcpDuplication (5QI=80)	disabled	enabled	Packet duplication for ultra-reliability
maxHarqTx (5QI=80)	4	8	More HARQ retransmissions for reliability
logicalChannelGroup (5QI=80)	1	0	Highest scheduling priority
prioritisedBitRate (5QI=80)	0	1000 kbps	Guaranteed bit rate for URLLC
bucketSizeDuration (5QI=80)	100 ms	10 ms	Smaller bucket for bursty URLLC traffic
schedulingRequestId (5QI=80)	1	0	Highest priority SR

Pre vs Post Optimization Impact:

KPI	Pre-Optimization	Post-Optimization	Δ	Impact
99th Percentile Latency (5QI=80)	52 ms	18 ms	−34 ms	Excellent
99.9th Percentile Latency	85 ms	25 ms	−60 ms	Exceptional
Reliability (1-Packet Loss)	99.9%	99.999%	+0.099%	Excellent
PDCP Duplication Overhead	0%	100%	+100%	High Cost
eMBB Throughput Impact	0%	−15%	−15%	Acceptable
URLLC SLA Compliance	65%	98%	+33%	Excellent

Final Technical Conclusion

The URLLC latency SLA violation was caused by scheduler prioritization gaps and PDCP reordering delays, which primarily impacted tail latency (P99 / P99.9).
By enabling PDCP duplication, enforcing strict scheduling priority, increasing HARQ reliability, and optimizing bucket and SR parameters, URLLC latency and reliability were restored to industrial-grade requirements with an acceptable trade-off on eMBB throughput.

Scenario 5: BLER Optimization for Massive MIMO Cells

OSS Symptoms & Alarms:

PM Counters:
Sector-specific high BLER in NBLER.DL.Beam_X.Avg

Example (Top Impacted Beams):

Beam 7 → 18.6%
Beam 11 → 17.9%
Beam 14 → 16.8%
Other beams → < 9%

Alarms:
“Beam Quality Degradation” on specific beams

Correlation:
Low NMIMO.Rank.Avg and poor NCQI.Beam_X.Avg

Example:

NMIMO.Rank.Avg
- Normal beams → 3.2
- Affected beams → 1.9
NCQI.Beam_7.Avg → 7.4
NCQI.Beam_11.Avg → 7.1

Pattern:
Affects users located in specific angular sectors

Expert Troubleshooting Steps:

Step 1: Beam-Specific Performance Analysis

Analyze performance by beam index using:

NBLER.DL.Beam.Avg
NRSRP.Beam.Avg
NSINR.Beam.Avg
NMIMO.Rank.Beam.Avg
NTHP.DL.Beam.Avg

Example observations (MIMO_CELL_03):

Beam 7

azimuth_degrees → 110°
elevation_degrees → 6°
avg_bler → 18.6%
avg_beam_rsrp → −98 dBm
avg_beam_sinr → 10.5 dB
avg_beam_rank → 1.8
served_ues → 95
beam_throughput → 85 Mbps

Beam 11

azimuth_degrees → 165°
elevation_degrees → 7°
avg_bler → 17.9%
avg_beam_rsrp → −97 dBm
avg_beam_sinr → 11.0 dB
avg_beam_rank → 2.0
served_ues → 102
beam_throughput → 92 Mbps

Observation:

High BLER is beam-specific, not cell-wide
SINR is moderate but rank selection is conservative
Throughput per beam is significantly degraded

Conclusion:
BLER degradation is linked to beam-level MIMO behavior, not RF coverage.

Step 2: MIMO Configuration Audit

Check MIMO and beamforming configuration using:

config_parameter
current_value
recommended_value
compliance_status

Example audit results:

codebookSubsetRestriction
- current → fully-restricted
- recommended → partially-restricted
- compliance_status → NON-COMPLIANT
csiRsDensity
- current → one
- recommended → three
- compliance_status → NON-COMPLIANT
beamReportingPeriodicity
- current → 160 ms
- recommended → 40 ms
- compliance_status → NON-COMPLIANT
rankIndicatorRestriction
- current → rank-4-allowed
- recommended → rank-2-only
- compliance_status → NON-COMPLIANT

Observation:

CSI and beam reporting too sparse for fast channel variation
Precoding flexibility is restricted
Rank selection not optimized for BLER stability

Conclusion:
MIMO configuration is over-optimized for peak throughput, causing BLER instability.

Step 3: Channel Correlation Analysis

Analyze channel correlation for MIMO performance using:

correlation_level
selected_rank
throughput_mbps
bler_percentage
ue_speed_kmh

Example observations (last 6 hours):

High Correlation

sample_count → 1,150
avg_selected_rank → 1.8
avg_throughput → 210 Mbps
avg_bler → 18.2%
avg_ue_speed → 12 km/h

Medium Correlation

sample_count → 980
avg_selected_rank → 2.4
avg_throughput → 295 Mbps
avg_bler → 11.5%
avg_ue_speed → 18 km/h

Low Correlation

sample_count → 760
avg_selected_rank → 3.1
avg_throughput → 380 Mbps
avg_bler → 6.2%
avg_ue_speed → 25 km/h

Observation:

High channel correlation leads to low rank and high BLER
Rank selection improves as correlation decreases

Conclusion:
Channel correlation directly impacts MIMO efficiency and BLER.

Parameter Optimization Strategy:

Parameter	Pre-Optimization	Post-Optimization	Rationale
codebookSubsetRestriction	fully-restricted	partially-restricted	More precoding flexibility
csiRsDensity	one	three	Denser CSI-RS for better channel estimation
beamReportingPeriodicity	160 ms	40 ms	Faster beam reporting for mobility
rankIndicatorRestriction	rank-4-allowed	rank-2-only	Conservative rank for better BLER
pmiRiReportPeriodicity	80 ms	20 ms	Faster PMI/RI reporting
srsBandwidth	BW4	BW8	Wider SRS for better UL channel estimation

Pre vs Post Optimization Impact:

KPI	Pre-Optimization	Post-Optimization	Δ	Impact
Average DL BLER	14.2%	6.8%	−7.4%	Excellent
MIMO Rank Utilization	2.8	2.2	−0.6	Acceptable
Beam Switching Success Rate	88%	96%	+8%	Good
CSI Reporting Accuracy	72%	89%	+17%	Excellent
Cell Throughput	850 Mbps	720 Mbps	−130 Mbps	Trade-off
User Consistency Index	65%	82%	+17%	Excellent

Final Technical Conclusion

The high BLER in the Massive MIMO cell was caused by beam-specific MIMO misconfiguration and high channel correlation, not by coverage or hardware faults.
By increasing CSI-RS density, improving reporting periodicity, relaxing precoding restrictions, and enforcing conservative rank selection, BLER and user consistency improved significantly with an acceptable throughput trade-off.