watcher

Author	SHA1	Message	Date
Matt Riedemann	9c9f336f10	Configure nova notification_format for grenade Nova changed the default notification_format from "both" to "unversioned" in Train [1]. Without configuring nova in the grenade job we are not testing the nova versioned notification handler code during upgrades. Note that grenade only runs stack.sh on the base (old) side so this change has to depend on a devstack stable/stein change to add the NOVA_NOTIFICATION_FORMAT variable that we override. Closes-Bug: #1831917 Depends-On: Ied9d50b07c368d5c2be658c744f340a8d1ee41e0 [1] https://review.opendev.org/603079/ Change-Id: I94c2d14477da185310e0fec596a1ad6436b802f1	2019-06-25 11:06:43 -04:00
Matt Riedemann	966a4dfa5f	Configure nova notification format in non-grenade CI jobs Nova used to emit versioned and unversioned notiifcations by default but that changed in https://review.opendev.org/603079/ so now nova emits only unversioned notifications by default. Watcher listens for versioned notifications so we need to configure nova to emit both versioned (for Watcher) and unversioned (for Ceilometer) notifications explicitly. This adds an override-defaults file so devstack will load up the nova devstack variable to set the notification_format before importing and stacking the nova lib script. Note that this only fixes the non-grenade CI jobs since grenade requires separate handling for overriding defaults which is proving hard to do and will be addressed in a separate change. Partial-Bug: #1831917 Change-Id: I7e441608b38338eecd80e663ed3abe66a89e504f	2019-06-24 13:09:34 -04:00
Zuul	899e534761	Merge "check instance state for instance.update"	2019-06-24 03:54:52 +00:00
Zuul	d046606a7e	Merge "remove tail_log"	2019-06-22 02:31:12 +00:00
Zuul	d98f51c452	Merge "Implement the configuration for Grafana datasource"	2019-06-21 02:52:24 +00:00
Zuul	5effcde632	Merge "Update strategy doc"	2019-06-21 01:37:39 +00:00
licanwei	9b8d1445f1	remove tail_log tail_log is deprecated and should be removed https://github.com/openstack/devstack/blob/master/functions-common#L1611 Change-Id: I6012fd63aa6b0cdb8ed1b278880f8f6c3e4d38cf	2019-06-20 15:28:33 +08:00
licanwei	90291923a1	Update strategy doc Ceilometer removed cpu_util metric in [1]. Another metric compute.node.cpu.percent need to set compute_monitors option to cpu.virt_driver in the nova.conf, we should remind user about these. [1]: https://review.opendev.org/#/c/580709/ Change-Id: I89306ef7c26fa2927945bd4f3ee88b670511d147	2019-06-20 10:57:05 +08:00
Dantali0n	06f8aa712a	Implement the configuration for Grafana datasource This implements the configuration parameters to implement Grafana as a datasource including the influxdb translator Change-Id: If1f27dc01e853c5b24bdb21f1e810f64eaee2e5c Partially-implements: blueprint grafana-proxy-datasource	2019-06-19 15:58:18 +02:00
zhufl	37b11fa404	Fix missing print format This is to add missing print format in placement_helper.py. Change-Id: I23492fd33f3df8c8709f6d6317c936221f1108d4	2019-06-19 11:43:18 +08:00
Zuul	fd04c67ed8	Merge "Map instance to its node"	2019-06-19 02:44:58 +00:00
Zuul	08622c6c80	Merge "typo ceilometer url"	2019-06-19 02:44:57 +00:00
licanwei	e4fc5a08ed	typo ceilometer url Change-Id: I2cb24249ed1e82dd85c0f586532e5a7cbc57f2c0	2019-06-18 19:29:00 +08:00
Zuul	413028e527	Merge "Fix base enable_plugin branch for grenade run"	2019-06-18 07:53:31 +00:00
Zuul	f8fef7d774	Merge "Replace removed exceptions and prevent regression"	2019-06-18 05:04:24 +00:00
Zuul	81fc8ac6a8	Merge "Define a new InstanceNotMapped exception"	2019-06-18 02:32:33 +00:00
Zuul	5d385db8a1	Merge "Cleanup ConfFixture"	2019-06-16 08:32:41 +00:00
Zuul	a1dd90bb74	Merge "Fix property access in test_global_preference* tests"	2019-06-14 20:17:17 +00:00
Dantali0n	15754a14dd	Replace removed exceptions and prevent regression Replaces the NoSuchMetric exception that was replaced. The exception is replaced with MetricNotAvailable and test cases are added to prevent regression. The changes in the exceptions were introduced in: https://review.opendev.org/#/c/658127/ Change-Id: Id0f872e916aaa5dec59ed1ae6c0f653553b3fe46	2019-06-14 22:00:41 +02:00
Zuul	667d2d661a	Merge "Move datasource query_retry into baseclass."	2019-06-14 09:04:15 +00:00
Zuul	b2111baf91	Merge "Backwards compatibility for node parameter"	2019-06-14 07:42:09 +00:00
licanwei	a4d978b893	Define a new InstanceNotMapped exception In get_node_by_instance_uuid, an exception ComputeNodeNotFound will be thrown if can't find a node through instance uuid. But the exception information replaces the node name with instance uuid, which is misleading, so we define a new exception. Closes-Bug: #1832156 Change-Id: Ic6c44ae44da7c3b9a1c20e9b24a036063af266ba	2019-06-14 10:51:20 +08:00
Zuul	6495e42a60	Merge "Optimize NovaHelper.get_compute_node_by_hostname"	2019-06-14 02:31:13 +00:00
Zuul	e4f80b5461	Merge "Optimize hypervisor API calls"	2019-06-14 02:28:51 +00:00
Zuul	2a2f7902bd	Merge "Improve DevStack documentation to support metrics"	2019-06-14 02:28:48 +00:00
Zuul	6502435dc5	Merge "Remove dead code"	2019-06-14 02:28:47 +00:00
Zuul	5f126cffe0	Merge "Add Placement helper"	2019-06-14 02:21:44 +00:00
Dantali0n	584eeefdc8	Move datasource query_retry into baseclass. Moves the query_retry method into the baseclass and makes the query retry and timeout options part of the watcher_datasources config group. This makes the query_retry behavior uniform across all datasources. A new baseclass method named query_retry_reset is added so datasources can define operations to perform when recovering from a query error. Test cases are added to verify the behavior of query_retry. The query_max_retries and query_timeout config parameters are deprecated in the gnocchi_client group and will be removed in a future release. Change-Id: I33e9dc2d1f5ba8f83fcf1488ff583ca5be5529cc	2019-06-13 15:52:53 +02:00
Matt Riedemann	28df60e275	Fix base enable_plugin branch for grenade run We should be starting from stable/stein on the "old" side of grenade runs now. Rather than hard-code the branch, just use the BASE_DEVSTACK_BRANCH variable. Change-Id: I1b0406f870ed0ae5622cfa7421a6cca00d0f891c	2019-06-13 09:51:19 -04:00
licanwei	7281f6184f	Remove dead code get_node_by_instance_uuid will never return None, so the OR condition is dead code. Change-Id: I26c553e1067a3cbeac6c0afe1c4bfdee4d939055	2019-06-13 17:31:49 +08:00
licanwei	79a57f67e6	Map instance to its node When receiving Nova notification instance.create.end, map instance to its node after adding instance to datamodel. Related-Bug: #1832156 Change-Id: I6f39e8d935195c611f668f71590e1d9ff52ced0d	2019-06-13 15:58:28 +08:00
chenming	731d4bfdf2	update contraints url http://lists.openstack.org/pipermail/openstack-discuss/2019-May/006478.html Change-Id: I1929fc98cd728eb0dae66762481880e23cd793c7	2019-06-12 19:11:53 +00:00
Dantali0n	dd119ca1f8	Backwards compatibility for node parameter Adds backwards compatibility for node parameter used by strategies. If the node value is set by the user configuration it will override the value for compute_node which is the value used by the strategies now. This change was introduced in: https://review.opendev.org/#/c/656622/ Resolution discussed in the meeting on the 5th of June 2019 https://eavesdrop.openstack.org/meetings/watcher/2019/watcher.2019-06-05-08.00.log.html Change-Id: Idaea062789a6b169e64f556fecc34cfbaaee5076	2019-06-12 16:23:58 +00:00
chenke	00f20ab1d4	Fix property access in test_global_preference* tests In Python, when we use @property, the method will be decorated by property. When we call method self.strategy.datasource_backend()[1], Actually it did two things: 1. call self.strategy.datasource_backend() 2. according to the method's return value[2], call self._datasource_backend() [1]. https://github.com/openstack/watcher/blob/bd8636f3f/watcher/tests/decision_engine/strategy/strategies/test_base.py#L87 [2]. https://github.com/openstack/watcher/blob/bd8636f3f/watcher/decision_engine/strategy/strategies/base.py#L368 But in this part, we just want it to perform the first step. So we have to use self.strategy.datasource_backend instead of self.strategy.datasource_backend() The reason why the unittest does not report an error is because the returned value is a mock object, and the second step is executed without error, for example: python -m unittest watcher.tests.decision_engine.strategy.strategies.test_base (Pdb) x=self.strategy.datasource_backend (Pdb) type(x) <class 'mock.mock.MagicMock'> (Pdb) x <MagicMock name='DataSourceManager().get_backend()' id='139740418102608'> (Pdb) x() <MagicMock name='DataSourceManager().get_backend()()' id='139740410824976'> (Pdb) self.strategy.datasource_backend() <MagicMock name='DataSourceManager().get_backend()()' id='139740410824976'> To make the tests more robust, the underlying backend function is mocked to be not callable. Co-Authored-By: Matt Riedemann <mriedem.os@gmail.com> Change-Id: I3305d9afe8ed79e1dc3affe02ba067ac06cece42	2019-06-12 12:00:45 -04:00
licanwei	b57feba5e8	Add Placement helper This patch added Placement to Watcher We plan to improve the data model and strategies in the future specs. Change-Id: I7141459eef66557cd5d525b5887bd2a381cdac3f Implements: blueprint support-placement-api	2019-06-12 11:11:13 +08:00
Matt Riedemann	251264b1b6	Cleanup ConfFixture This makes the ConfFixture extend the Config fixture from oslo.config which handles cleanup for us. The module level import_opt calls are also removed since they are no longer needed. Change-Id: I869e89c53284c8da45e0b1293f2d35011f5bfbf9	2019-06-11 20:18:20 -04:00
Zuul	46a36d1ad7	Merge "Fix string formatting"	2019-06-11 09:58:38 +00:00
licanwei	2d4bc095db	Fix string formatting Change-Id: Iaf995355ec542db076683374c6128656bee2ee6f	2019-06-10 17:16:51 +08:00
licanwei	f9e267fa42	check instance state for instance.update In the process of creating an instance, Nova will emit an instance.update notification with 'building' state. This will cause a KeyError exception because this instance isn't in Watcher datamodel. So we should ignore the notification instance.update with 'building' state. Closes-Bug: #1832154 Change-Id: I950eec50d2cee38bd22c47a70ae6f88bbf049080	2019-06-10 16:11:46 +08:00
Zuul	8000dd650f	Merge "add strategy tempest job"	2019-06-06 09:11:19 +00:00
licanwei	c3e0e41fbf	add strategy tempest job Change-Id: I84a68f92fa34cf19487323f2afb89d379b8d80f5	2019-06-06 07:18:53 +00:00
licanwei	7f37f7b92a	Remove apidoc Now there are some errors when running apidoc, actually we don't need apidoc, so remove it. Closes-Bug: #1831515 Change-Id: I3b91a2c05ed62ae7bbd30a29e9db51d0e021410f	2019-06-04 11:34:07 +08:00
Matt Riedemann	374fd2791f	Optimize NovaHelper.get_compute_node_by_hostname The get_compute_node_by_hostname method is given a compute service hostname and then does two queries to find the matching hypervisor (compute node) with details: 1. List hypervisors with details and find the one that matches the given compute service hostname. 2. Using that node, search for hypervisors with the matching hypervisor_hostname. There are two issues here: 1. The first query is inefficient in that it has to list all hypervisors in the deployment to try and match the one with the compute service hostname client side. 2. The second query is a fuzzy match on the server side [1] so even though we have matched on the node we want, get_compute_node_by_name can still return more than one hypervisor which will result in the helper method raising ComputeNodeNotFound. Consider having compute hosts with names compute1, compute10, compute11, compute100, and so on. The fuzzy match on compute1 would return all of those hypervisors. For non-ironic nodes in nova, the compute service host and hypervisor should be 1:1, meaning the hypervisor.service['host'] should be the same as hypervisor.hypervisor_hostname. Knowing this, we can simplify the code to search just on the given compute service hostname and if we get more than one result, it is because of the fuzzy match and we can then do our client-side filtering on the compute service hostname. [1] https://github.com/openstack/nova/blob/d4f58f5eb/nova/db/sqlalchemy/api.py#L676 Change-Id: I84f387982f665d7cc11bffe8ec390cc7e7ed5278	2019-06-03 12:18:54 -04:00
Matt Riedemann	3f76f9cfdb	Optimize hypervisor API calls The nova CDM builder code and notification handling code had some inefficiencies when it came to looking up a hypevisor to get details. The general pattern used before was: 1. get the minimal hypervisor information by hypervisor_hostname 2. make another query to get the hypervisor details by id In the notifications case, it was actually three calls because the first is listing hyprvisors to filter client-side by service host. This change collapses 1 and 2 above into a single API call to get the hypervisor by hypervisor_hostname with details which will include the service (compute) host information which is what get_compute_node_by_id() was being used for. Now that nothing is using get_compute_node_by_id it is removed. There is more work we could do in get_compute_node_by_hostname if the compute API allowed filtering hypervisors by service host so a TODO is left for that. One final thing: the TODO in get_compute_node_by_hostname about there being more than one hypervisor per compute service host for vmware vcenter is not accurate - nova's vcenter driver hasn't supported a host:node 1:M topology like that since the Liberty release [1]. The only in-tree driver in nova that supports 1:M is the ironic baremetal driver, so the comment is updated. [1] Ifc17c5049e3ed29c8dd130339207907b00433960 Depends-On: https://review.opendev.org/661785/ Change-Id: I5e0e88d7b2dd1a69117ab03e0e66851c687606da	2019-06-03 12:18:54 -04:00
zhufl	9c1b83e610	Add missing ws separator between words This is to add missing ws separator between words. Change-Id: Iab23ce2ad081fef18978579594886950b8e2cb01	2019-05-31 14:51:30 +08:00
Zuul	5f94eef027	Merge "Group instance methods together in nova_helper"	2019-05-31 02:58:11 +00:00
Zuul	788f0055c1	Merge "Improve exceptions and logging in ds manager"	2019-05-31 02:57:33 +00:00
Zuul	88fb097539	Merge "Audit API supports new force option"	2019-05-29 10:01:04 +00:00
Zuul	ee3cbe46ef	Merge "Fix test_metric_file_override metric from backend"	2019-05-29 08:13:31 +00:00
Dantali0n	a00daf9f26	Group instance methods together in nova_helper Some of the methods for retrieving data about instances was placed at the bottom of nova_helper instead of being close to the other instance based methods. Change-Id: I68475883529610e514aa82f1881105ab0cf24ec3	2019-05-29 09:02:21 +02:00

1 2 3 4 5 ...

2084 Commits