summaryrefslogtreecommitdiffstats
path: root/book-enea-nfv-core-release-info/doc/known_bugs_and_limitations.xml
blob: bc682aba87042fa78c2f79ac38228d6ff51d5afd (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
<?xml version="1.0" encoding="ISO-8859-1"?>
<!DOCTYPE chapter PUBLIC "-//OASIS//DTD DocBook XML V4.2//EN"
"http://www.oasis-open.org/docbook/xml/4.2/docbookx.dtd">
<chapter id="bugs-limitations">
  <title>Known Issues and Limitations</title>

  <para>Problems and limitations relevant to the current release are detailed
  in this chapter. Corrections to bugs detected by Enea are submitted
  upstream, and remaining issues are listed below.<remark>INFO: The <emphasis
  role="bold">Release-Specific Problems</emphasis> section further down is
  generated from JIRA with gen_known_issues.py, but that script is HARDCODED
  with affectedversion "Enea NFV 1.0.1" and needs to be adapted when a release
  info for another ENFV version changes.</remark></para>

  <section id="bugs-limitations-gen">
    <title>Release Specific Issues</title>

    <orderedlist>
      <listitem>
        <para>Live migration fails on ThunderX nodes with different CPU
        Revisions</para>

        <itemizedlist>
          <listitem>
            <para>Description and Impact:</para>

            <para>For an aarch64 architecture, QEMU describes a CPU in terms
            of Vendor, Model and Revision number in contrast with the x86
            architecture, where CPU supported features (e.g. the instruction
            set supported) are used.</para>

            <para>QEMU lacks support in describing a CPU in terms of features
            for an aarch64 architecture, causing the migration of VMs between
            hosts with different CPU revisions to fail.</para>
          </listitem>

          <listitem>
            <para>Workaround:</para>

            <para>In the deployment pod use compute nodes with the same
            Vendor, Model, and Revision number CPUs.</para>
          </listitem>
        </itemizedlist>
      </listitem>

      <listitem>
        <para>Instances fail to boot when using a direct port (SR-IOV) on
        ThunderX</para>

        <itemizedlist>
          <listitem>
            <para>Description and Impact:</para>

            <para>Deployment is successful with SR-IOV enabled interfaces
            however, instances fail to boot when a direct bound (SR-IOV) port
            is added. This has been tested using a SR-IOV capable PCI Express
            Network Interface. As a consequence it is impossible to
            passthrough a SR-IOV port on ThunderX.</para>
          </listitem>

          <listitem>
            <para>Workaround: N/A.</para>
          </listitem>
        </itemizedlist>
      </listitem>

      <listitem>
        <para>Security groups are not working correctly for ICMP traffic in
        deployments with OpenDaylight.</para>

        <itemizedlist>
          <listitem>
            <para>Description and Impact:</para>

            <itemizedlist>
              <listitem>
                <para>When OPNFV is deployed with OpenDaylight as an SDN
                controller, the Security Groups rules pertaining to ICMP do
                not work as expected. The OpenFlow rules describing the ICMP
                rules are inconsistent, so VMs can be pinged even when this is
                not desired.</para>
              </listitem>

              <listitem>
                <para>This reproduces on aarch64. On x86 the security groups
                work correctly.</para>
              </listitem>
            </itemizedlist>
          </listitem>

          <listitem>
            <para>Workaround: N/A.</para>
          </listitem>
        </itemizedlist>
      </listitem>

      <listitem>
        <para>Virtual instances do not get IP from DHCP in SFC scenarios with
        ODL</para>

        <itemizedlist>
          <listitem>
            <para>Description and Impact:</para>

            <itemizedlist>
              <listitem>
                <para>After a fresh deploy of OPNFV with OpenDaylight and the
                Service Function Chaining scenario configurations, instances
                fail to get IP from DHCP, due to OpenDaylight
                malfunctioning.</para>
              </listitem>

              <listitem>
                <para>The SFC VNFs are not reachable via SSH for management
                and configuration.</para>
              </listitem>
            </itemizedlist>
          </listitem>

          <listitem>
            <para>Workaround: Restarting OpenDaylight via
            <literal>systemctl</literal> fixes the problem.</para>
          </listitem>
        </itemizedlist>
      </listitem>

      <listitem>
        <para>Scale-in and Scale-out operations are not supported</para>

        <itemizedlist>
          <listitem>
            <para>Description and Impact:</para>

            <itemizedlist>
              <listitem>
                <para>When adding or removing nodes via Fuel installer on an
                OPNFV deployment that uses OpenDaylight, flows in
                <filename>ovs</filename> tables on the existing and new nodes
                are not updated properly. Similarly, OpenvSwitch ports
                interfaces are not removed when deleting nodes.</para>
              </listitem>

              <listitem>
                <para>As a consequence, when adding a compute node to an
                existing OPNFV deployment, the instances running on the new
                compute node cannot be reached by floating IP. Scale-out and
                scale-in operations are flawed.</para>
              </listitem>
            </itemizedlist>
          </listitem>

          <listitem>
            <para>Workaround: The rules can be manually configured, but this
            is not scalable.</para>
          </listitem>
        </itemizedlist>
      </listitem>

      <listitem>
        <para>Hot-plugging network interfaces does not work in aarch64
        guests</para>

        <itemizedlist>
          <listitem>
            <para>Description and Impact:</para>

            <itemizedlist>
              <listitem>
                <para>When the user tries to hot-plug a network port to a
                running instance, this fails due to virtio negotiation failure
                between the host and guest. This happens when the VM is using
                a Linux Kernel version prior to 4.10. Port addition or
                deletion is not possible.</para>
              </listitem>

              <listitem>
                <para>This reproduces on aarch64.</para>
              </listitem>
            </itemizedlist>
          </listitem>

          <listitem>
            <para>Workaround: The user needs to upgrade the guest Linux Kernel
            to 4.10 or later. In case of port addition, the user also needs to
            manually set the link up and trigger the
            <literal>dhclient</literal> request.</para>
          </listitem>
        </itemizedlist>
      </listitem>

      <listitem>
        <para>Fuel Healthcheck Configuration tests fail when default
        credentials are used</para>

        <itemizedlist>
          <listitem>
            <para>Description and Impact:</para>

            <itemizedlist>
              <listitem>
                <para>The Configuration tests from the Fuel Healthcheck fail
                when the OPNFV cluster operates with the default credentials.
                This has no impact on the overall cluster
                functionality.</para>
              </listitem>

              <listitem>
                <para>This is architecture independent.</para>
              </listitem>
            </itemizedlist>
          </listitem>

          <listitem>
            <para>Workaround: In order to make these 3 test cases pass, the
            user needs to:</para>

            <itemizedlist>
              <listitem>
                <para>Change the credentials for the root user on the Fuel
                Master node: after logging on as root on the master node,
                issue command <command>passwd</command> and provide the new
                password.</para>
              </listitem>

              <listitem>
                <para>Change the default credentials for the OpenStack
                cluster: in Fuel Dashboard, for an Environment, change
                Settings|General|OpenStack Access &gt; Password to something
                other than the default.</para>
              </listitem>

              <listitem>
                <para>Change the default credentials for Keystone on the
                master node: in Fuel Dashboard, click on the user avatar and
                select <literal>Change Password</literal>. This should be done
                before deploy or you will need to redeploy to have the changes
                applied.</para>
              </listitem>
            </itemizedlist>
          </listitem>
        </itemizedlist>
      </listitem>

      <listitem>
        <para>Fuel Healthcheck Stack update test fails</para>

        <itemizedlist>
          <listitem>
            <para>Description and Impact:</para>

            <para>The Platform test case number 5 (Update stack) from the Fuel
            Healthcheck sometimes fails. This has no impact on the overal
            cluster functionality.</para>
          </listitem>

          <listitem>
            <para>Workaround: N/A.</para>
          </listitem>
        </itemizedlist>
      </listitem>

      <listitem>
        <para>Issue #1 with Openstack Resource Agents and Compute Fencing
        functionality</para>

        <itemizedlist>
          <listitem>
            <para>Description and Impact:</para>

            <para>In an OPNFV deployment that uses Openstack Resource Agents,
            the neutron-openvswitch-agent is killed by Pacemaker when booting,
            due to Pacemaker misconfiguration.</para>
          </listitem>

          <listitem>
            <para>Workaround:</para>

            <para>Starting the <literal>systemd</literal> service manually
            makes it run successfully. Enea NFV Core 1.0.1 is shipped without
            Openstack Resource Agents, therefore this issue should not affect
            the user.</para>
          </listitem>
        </itemizedlist>
      </listitem>

      <listitem>
        <para>Issue #2 with Openstack Resource Agents and Compute Fencing
        functionality</para>

        <itemizedlist>
          <listitem>
            <para>Description and Impact:</para>

            <para>In an OPNFV deployment that uses Openstack Resource Agents,
            when we configure the <literal>fence_compute</literal> as a
            Pacemaker resource, the Controller nodes start to reboot each
            other endlessly.</para>
          </listitem>

          <listitem>
            <para>Workaround:</para>

            <para>Enea NFV Core 1.0.1 is shipped without Openstack Resource
            Agents, therefore this issue should not affect the user.</para>
          </listitem>
        </itemizedlist>
      </listitem>

      <listitem>
        <para>Virtual instances are not affected by removing a node from the
        Ceph Storage Cluster</para>

        <itemizedlist>
          <listitem>
            <para>Description and Impact:</para>

            <para>Engineering wanted to validate the survival of storage
            systems when a single disk is removed, without causing data loss.
            Without physical access to the test setup, this test is not
            feasible.</para>
          </listitem>

          <listitem>
            <para>Workaround:</para>

            <itemizedlist>
              <listitem>
                <para>The chosen approach was to validate what happens to the
                Ceph cluster, when network connectivity is lost for the
                Storage interface of one of the nodes. No impact was observed
                when running an instance using Ceph for volume storage.</para>
              </listitem>

              <listitem>
                <para><ulink
                url="http://ceph.com/geen-categorie/admin-guide-replacing-a-failed-disk-in-a-ceph-cluster/">Reference
                information</ulink></para>
              </listitem>
            </itemizedlist>
          </listitem>
        </itemizedlist>
      </listitem>

      <listitem>
        <para>Offline Deploy with Fuel fails at times</para>

        <itemizedlist>
          <listitem>
            <para>Description and Impact:</para>

            <para>The Postfix installation and post-install configuration
            sometimes take longer when the Fuel deployment is performed
            without Internet connectivity. The <emphasis
            role="bold">tools</emphasis> Puppet task has a 5 minute completion
            timeout, causing there to not be enough time left for the other
            components of the tools task to be executed.</para>
          </listitem>

          <listitem>
            <para>Workaround:</para>

            <para>Simply press <literal>Deploy changes</literal> again. Since
            by now Postfix is installed, it will skip the installation and
            finish in time.</para>
          </listitem>
        </itemizedlist>
      </listitem>

      <listitem>
        <para>Volumes fail to attach during the first few seconds of a VM
        booting procedure</para>

        <itemizedlist>
          <listitem>
            <para>Description and Impact:</para>

            <para>There is a short period of a few seconds after a VM stated
            the boot procedure when attaching an external volume will result
            in an error to claim the qemu assigned bus in the guest Image
            kernel.</para>

            <para>Disks attached in the first few seconds of the booting
            procedure of a VM, will fail to register in the guest VM.</para>
          </listitem>

          <listitem>
            <para>Workaround:</para>

            <para>Attach volumes to a VM only before the VM is booting or
            after the period of a few seconds has passed since the booting
            procedure started.</para>
          </listitem>
        </itemizedlist>
      </listitem>

      <listitem>
        <para>Fuel Healthcheck Stack creation with wait condition test,
        fails</para>

        <itemizedlist>
          <listitem>
            <para>Description and Impact:</para>

            <para>The platform test case (create stack with wait condition)
            from the Fuel Healthcheck, fails. This has no impact on overall
            cluster functionality.</para>
          </listitem>

          <listitem>
            <para>Workaround: N/A.</para>
          </listitem>
        </itemizedlist>
      </listitem>

      <listitem>
        <para>On Mixed Arch Deployment, only the aarch64 TestVM Cirros image
        will be installed by Fuel</para>

        <itemizedlist>
          <listitem>
            <para>Description and Impact:</para>

            <para>Due to the fact that Fuel will only deploy the aarch64
            image, Yardstick, Functest, and certain Health Check tests will
            not work. These test suites are dependent on a single image name
            at a time, and do not know on how to place instances on the
            Compute for images that each require a different arch.</para>

            <para>To have both testVM images, the user must add the x86_64
            image manually.</para>
          </listitem>

          <listitem>
            <para>There is no workaround for the test suites failures.</para>
          </listitem>
        </itemizedlist>
      </listitem>
    </orderedlist>
  </section>

  <section condition="hidden" id="bugs-limitations-doc">
    <title>Documentation</title>

    <itemizedlist spacing="compact">
      <listitem>
        <para><emphasis role="bold">PDF navigation</emphasis>: When using
        links to open other PDFs, or jump to another place in the same PDF,
        jumping back sometimes fails. This has been observed when opening a
        PDF in Adobe Reader, inside a browser with PDF add-on, as well as when
        the browser is configured to open PDF files in an external PDF reader.
        As a workaround, open the HTML version of the
        document.<remark>LXCR-3283</remark></para>
      </listitem>
    </itemizedlist>
  </section>

  <section condition="hidden" id="bugs-limitations-other">
    <title>Miscellaneous</title>

    <itemizedlist spacing="compact">
      <listitem>
        <para></para>

        <remark>list all misc problems here.</remark>
      </listitem>
    </itemizedlist>
  </section>

  <!-- The file with a section below is autocreated by make init

  <xi:include href="jiraissues_generated.xml"
              xmlns:xi="http://www.w3.org/2001/XInclude" /> -->
</chapter>