TheSaffaGeek

My ramblings about all things technical


5 Comments

EMEA #vBrownbag

vBrownBagLogo-150x150That’s right, after having spoken to the infamous APAC and US vBrownbag teams at VMworld and via the VMware vExpert communities Arjan Timmerman and I have thrown our hats into the ring and have asked to borrow the guys vBrownbag’s and show the world that the EMEA VMware guru’s are the some of the best in the world.

We are looking to start doing them within the next few weeks and need all the VMware talent in EMEA to put their hands up and come present and for everyone to join the calls/webinars and contribute. We are looking to possibly follow two different tracks so as to not repeat what the APAC and US guys have done and so will be looking for people to present on objectives in the VCP5-IaaS and VCP5-DT exam blueprints. You can do any of the objectives so if you want to do the last objective for the IaaS then first come first served/booked.

We are thinking of doing it every fortnight on a Tuesday starting on the 23rd of October although if there is a very good amount of interest which I trust there will be then we can make it a weekly occurrence. We are currently looking to do it at 7PM GMT to allow everyone in Europe the time to get home.Please spread the word and lets show the VMware community the talent EMEA has to offer. You can register here for the webinars

We have created a Google form for people to start putting in their interest and what they want to talk about here.

Gregg and Arjan


2 Comments

VCDX Spotlight: Artur Krzywdzinski

Name: Artur Krzywdzinski

Twitter Handle: @artur_ka

Blog URL: http://vmwaremine.com

Current Employer: GlassHouse Technologies

VCDX #: 77

How did you get into using VMware?

It was beginning 2005 I saw article in internet about virtualisation using Windows Virtual PC software. After few weeks I started looking around for software which supports Novell Netware and Linux as a VM and I found VMware and VMware workstation – which I still use in my HomeLab. That’s how I started. Next step was a small implementation of VI 3.5 (2 nodes + vCenter) – this was a big deal for me at that timeJ.

What made you decide to do the VCDX?

I got my first VMware certification VCP3 in 2007 and next step was VCDX. As always my professional goal was to become an IT Architect I thought that having VCDX accreditation would help me to achieve that goal.

How long did it take you to complete the whole VCDX journey?

I started my VCDX journey when I was working at IBM (2008 – 2011 Brno, CZ). During that time I met Stephane Bourdeaud (IBM Architect), he was an account architect on one of the IBM customers which me and my team in Brno provided support on. Thanks to Stephane I started doing VMware designs for IBM customers, we spent tens of hours hanging on the phone talking about my designs. Thank you Stephane one more time.

I finish my journey in May 2012 on VCDX defence panel in Toronto.

What advice would you give to people thinking of pursuing the VCDX accreditation?

· Take your time and get experience in technology and design.

· Make sure that you started you VCDX design project at least few months before submission dead line – good design is 50% of success.

· Make sure you sent your design to peers for reviewing – is a must if you want to have your design accepted.

· Before VCDX defence panel make few mock defences with friends which know technology

· On defence itself, don’t get stressed, panellists are very friendly and very helpful.

If you could do the whole VCDX journey again what would you do differently?

From a project, preparation for defence or defence itself perspective – I would do exactly the same way as I did. Why? Cause I passed on a first attempt. What I would do differently would be before I start a VCDX project I would find a company which would be interested in heaving VCDX certified on board and sponsoring whole project J, is quite expensive accreditation especially if you have to pay for everything (defence fee, travel, hotel) yourself, as I had to.

Life after the VCDX?  How did your company respond?  Was it worth it.

Not to enthusiastic, to be honest. Nobody even noticed I got VCDX.

Was it worth it? Hard to give a straight answer, there are two aspects. First – community, VCDX accreditation is seen as very big achievement which not everybody can reach it. Second is business – unfortunately VCDX is barely recognized by IT managers, directors, head-hunters and demand for VCDX certified professionals is very, very low. But I’m still hoping it was worth it


Leave a comment

VCDX Spotlight: Shane White

Name: Shane White

Twitter Handle: @ausvmguy

Blog URL: none (yet)

Current Employer: Southern Cross Computer Systems

VCDX #: 95

How did you get into using VMware?

In 2007, I started playing with Workstation. In the 2nd half of that year, I completed a training program I had been on for a while. I had seen ESX 2.5 installed but didn’t know a lot about it. When I asked my employer what the options were for continued training and/or specialisation and virtualisation was an option, I elected to go down this road. I got my VCP3 in October of that year and had the opportunity to do nothing but VMware onsite for the next 3 years or so with one of our clients.

What made you decide to do the VCDX?

I have always liked a challenge. With the exposure and skills I obtained onsite, and the satisfaction I got from working with VMware technologies, I decided that attempting VCDX would be enjoyable, definitely challenging, but not unachievable.

How long did it take you to complete the whole VCDX journey?

If you count from the time I decided to go for it, just under 2 years. Due to my commitments onsite, I couldn’t spend large blocks of time on my design. I achieved both VCAP4 exams in Nov 2010 (4 days apart!). I started working up the design in late November 2010 and had my 1st defence attempt in Singapore in November 2011, which was unsuccessful. The feedback received from this defense and from a 2nd unsuccessful defense in Toronto in May 2012 was applied to a revised submission aimed for defense at VMworld in San Francisco in August 2012. On the 3rd attempt, I was successful!

What advice would you give to people thinking of pursuing the VCDX accreditation?

I found it particularly valuable to have had practical experience working with a real environment. While a home lab is very useful as well, a real environment will encounter real design decisions and configuration issues that may not arise in a home environment.

Consider carefully the kind of design you are planning on submitting. My design was a totally fictitious one. My success shows that you can defend a fictitious design, but it is likely to be harder than if a real design is used.

Either way, read the blueprint and make sure you address ALL the criteria and that you know, not only what your design decisions are, but also understand why that choice was the most appropriate in your situation. This means having a good grasp on the business requirements that have a bearing on those decisions.

Don’t get discouraged if your 1st (or subsequent) attempts are unsuccessful. View it as an opportunity to improve. Achieving VCDX has been described as a journey, and the entire journey can be beneficial and bring satisfaction if you let it.

If you could do the whole VCDX journey again what would you do differently?

I’d read the blueprint more closely and more often! Apart from that, I wouldn’t do anything else differently. I enjoyed the whole experience and found the whole process immensely educational.

Life after the VCDX?  How did your company respond?  Was it worth it?

SCCS was definitely pleased with the successful defense.

As for me, the sense of satisfaction of achieving something significant is immense. The increased confidence when facing a situation is also noticeable.

Was it worth it? Absolutely. Even if I had decided not to reattempt, the process of developing the design, all the researching, and preparing for the defense and developing the presentation was of great benefit.


Leave a comment

VCDX Spotlight: Sachin Bhowan

Name: Sachin Bhowan

Twitter Handle: @sbhowan

Current Employer: VMXperts

VCDX #: 38

How did you get into using VMware?

My first use of VMware was back in 2004 when we were investigating bare metal recoveries for Tivoli Storage Manager (TSM) and were experimenting with WinPE and GSX. I was looking for a way to standardize the recovery hardware platform for recovery as at the time there was no bare metal recovery option for TSM. It was after that then I started exploring and testing the ESX platform for server workloads and with the onset of Version 3, as the cliché goes….rest became history!

What made you decide to do the VCDX?

An interesting question in that there are two reasons for this; the main reason was for the challenge to prove to myself that the solutions I was actively “preaching” and delivering on were on par and on the right track. The second reason was simply being at the right place at the right time as we were having a meeting with our VMware Partner Manager and our CEO was complaining about the lack of elitism within the VMware certification portfolio and then they announced the VCDX program. I was then “volunteered” to make this happen!

How long did it take you to complete the whole VCDX journey?

I started this journey in September 2008 and tried to track and get as much information as possible, however being in the geo that I was, the most helpful information I got was from the active blogs run by Duncan Epping, Rick Scherer, VCDX001, and twitter tips, as exam information and details were only given out after successful registrations. The process was also impacted by the fact that I had to take all my exams in Europe or the US as there was no authorized testing centre in South Africa at that time. Therefore I had to wait for exams to open as well as align them with the major VMware events to reduce costs. This also forced me to be prepared at very short notice when I was given the availability of the exams! That said I completed my defence in February 2010 giving a total time of about 18 months.

What advice would you give to people thinking of pursuing the VCDX accreditation?

My advice is to be passionate about what you are doing and if you feel that doing the research and working out the solution interdependencies and limitations is mundane and tedious then this might not be the right track for you. Working towards the VCDX in my experience involved a lot of patience, dedication, passion and not to mention discipline as this is a method for working studiously and diligently on an everyday basis because not all solution requirements are the same. It also means investing a lot of your time over and above of your daily responsibilities and duties (work and family included); so commit for the long haul.

If you could do the whole VCDX journey again what would you do differently?

I do not think I would change much aside from the fact that with exams now available locally it would have meant a shorter timespan to get the accreditation.

Life after the VCDX?  How did your company respond?  Was it worth it?

Life for me has pretty much been the same since I achieved the VCDX accreditation and simply so because I chose for it to be that way as there were some tempting offers and I opted out for personal reasons. However I am now responsible for heading up a new business called VMXperts, that is a subsidiary of my former company Aptronics. (You can guess what platform this company’s focus will be). As for the response from my company everyone was simply elated and it also went well that I was the very first person in Africa to achieve this. This made their investment in the VCDX program a worthy one!


Leave a comment

VCDX Spotlight: Wade Holmes

Name: Wade Holmes

Twitter Handle: @wholmes

Blog URL: www.vwade.com

Current Employer: VMware

VCDX #: 15

How did you get into using VMware?

The year was 2004. I was an IT Specialist working in the IBM’s Business Continuity and Resiliency Services, and became aware of customers utilizing VMware for backup and recovery of their datacetners. I went to my manager at the time and told him about this trend, and that I was interested in becoming a VMware SME for IBM BCRS. I started working with ESX 2.0 and VirtualCenter 1.0, attended VMware training, and in 2005 became a VMware Certified Professional. During this time I spearheaded the creation and rollout of IBM BCRS’s first VMware based warm-site disaster recovery offering across the US, reducing the RTO of numerous fortune 500 clients. And so began the journey towards VMware excellence!

What made you decide to do the VCDX?

It was early 2008, and the VCDX certification was announced. The rigor of the requirements immediately attracted me to pursuing the certification. I knew this was a certification that could help further my career as an IT professional.

How long did it take you to complete the whole VCDX journey?

I completed the VCDX the summer of 2009, after taking the beta exam and defending during the first publically available defense. Below was the path I took before defending.

VCP on VI3
Enterprise Exam (beta)
Design Exam (beta)

I had no idea what to expect when coming to defend, and was extremely nervous. I spent countless hours preparing, reviewing my design, making sure I knew the in’s and out, and could justify every granular detail I documented. Luckily, that was exactly the approach necessary for me to be successful. I can’t describe how happy I was when I got a phone call that I passed and was a VCDX! (yes, back then I was actually contacted by phone to be informed I passed). In becoming VCDX #15, I was the first non-VMware employee worldwide to achieve the certification (as I worked for a partner at the time).

What advice would you give to people thinking of pursuing the VCDX accreditation? Dive in hear first to master your craft. Understand not just the what, but more importantly the why of architecture and design. Use the plethora of resources available to you online to become familiar with the format. Sign up for a VCDX Bootcamp to help prepare.

If you could do the whole VCDX journey again what would you do differently? Nothing except more sleep the night before the defense (if you can sleep).

Life after the VCDX?  How did your company respond?  Was it worth it

I believe I have a unique perspective on the VCDX program, having completed the VCDX program in 2009 as the first non-VMware VCDX worldwide, and then joining VMware and participating in the ongoing development of the program as a panelist. As an outsider looking in, the VCDX program was a goal that drove me to work on my craft, and become a better architect.

Since joining VMware, my participation in the VCDX program has only helped to hone my skills as a virtualization and cloud architect. It has forced me to sharpen my understanding of enterprise architecture principals, principals that aid me greatly in my day-to-day role dealing with virtualization and cloud solutions. I will be forever grateful to the VCDX program in providing a vehicle that forced me to push myself, and aiding me to take my career to another level.


Leave a comment

VCDX Spotlight: Randy Stanley

Name: Randy Stanley

Twitter Handle: @randystanley

Blog URL: http://www.randystanley.com

Current Employer: IT Partners

VCDX #: 94

How did you get into using VMware?

In 2003 I was working for a small software development company managing their business applications and supporting their software development team. Initially we began utilizing VMware GSX Server for those simple use cases trying to consolidate and save on our hardware spend where ever we could. In support of the software development team we also deployed ESX in a lab environment for testing and development purposes only. A fairly common introduction and use case early on in the adoption of VMware solutions. Plus, vMotion was the coolest freakin’ thing I had ever seen.

It wasn’t until I re-entered the consulting field in 2007 that I really started to dive deep into the VMware products and they have been an integral part of every solution we sell and deploy. It was this exposure to the VMware technology that really allowed me to develop my abilities and deepen my experience. I also should say that a large draw for me was the large, friendly and helpful community that supported and shared knowledge around the VMware products; easily the best community with which to be associated.

What made you decide to do the VCDX?

For me the decision was twofold, first because I’ve had the great fortune of working with one of the best consultants I know in Doug Baer, VCDX #19 and second for the shear challenge of obtaining the certification. A natural, underlying part of the equation has always been my love of the technology and interest in understanding how it works at its core. In my current line of work, utilizing the skills and knowledge measured by the VCDX certification is highly relevant and in many ways a validation of those abilities.

How long did it take you to complete the whole VCDX journey?

It’s hard to say exactly when the journey started, as I had wanted to go after it for the last couple of years, but it seemed so far off and I never really got going. In May 2011, I started and then stopped my journey with a failed attempt on the required VCAP-DCA exam which in combination with a heavy load of customer commitments limited my ability to focus on it. Since I wasn’t accustomed to failing an exam, the DCA failure caught me off guard and I needed to regroup. It was then about 6 months later over the 2011 Thanksgiving (US) holiday that I had a little heart-to-heart with myself and decided regardless of the time, effort or success, I was going to go after the VCDX4 before it was updated to version 5. I was leaving too many good designs on the table which I had worked on with vSphere 4 to not try to at least defend one of them. That’s when my real, 6-month journey toward VCDX began. This involved the DCD4 exam in December, the DCA4 exam in January, the VCP5 upgrade and the DCD5 beta in February, the VCDX4 Design and application in March and then the VCDX4 Defense in May. Approximately 6-months start to finish, but ultimately the journey never ends or at least I hope it doesn’t.

What advice would you give to people thinking of pursuing the VCDX accreditation?

My advice to those interested in the VCDX would be to dedicate themselves to the investment of time and resources necessary in the effort. This may mean the setup of a home lab, the time to read product guides, the repetition of product implementation and design, and/or the review of countless blogs and knowledge base articles. But beyond having a sound technical and architectural knowledge it will also require comfort in the spotlight, an ability to present from a white board, a quickness to think on your feet, an ability to envision the big picture design, and an openness to feedback, critique and improvement. With all that said, bottom line for anyone seriously considering it, I would say go for it. You’ll never know what could have been if you don’t try. I believe many will be surprised by what they can accomplish when they focus on a goal like the VCDX.

If you could do the whole VCDX journey again what would you do differently?

I probably would have started it earlier. Overall I felt the execution was successful once I got going, but for me it was just the issue of starting and sticking with it. Beyond that I don’t think I would have changed much.

Life after the VCDX?  How did your company respond?  Was it worth it

In my consulting position, the certifications are very much a part of the role and needed by the company to market, sell and deliver the solutions that we focus on. The certification definitely brought some recognition and accolades. It also provided some instant credibility amongst those in our community. For the most part, I do believe it was worth it mainly because of the challenge it provided to me and the opportunity to do what I love most which is work with the technology, understand the architecture of the products, solve the business problems of my customers, and participate in a community that is passionate about all these same things.


Leave a comment

VCDX Spotlight: Rick Scherer

Name: Rick Scherer

Twitter Handle: @rick_vmwaretips

Blog URL: http://www.vmwaretips.com

Current Employer: EMC

VCDX #: 21

How did you get into using VMware?

I first heard of and started using VMware Workstation in late 1999. I was a UNIX Administrator that was forced to live in a corporate Windows world. Workstation allowed me to have the best of both worlds. After thorough use and testing of the GSX and ESX products, by 2003 I was able to convince my (then employers) management that virtualization was a must for our datacenter. The rest is history.

What made you decide to do the VCDX?

Since day one I’ve been convinced that virtualization was a huge benefit for organizations large and small. Obtaining my first VCP (VCP2 #7315) in 2006 I’ve been following the work of the education team closely. When I was invited to be a beta participant in the then newly created VCDX program I saw it as an opportunity to validate my dedication, knowledge and experience. By making a candidate jump through as many hurdles that the VCDX program has, it really shows that VMware users are dedicated to the cause. This is a great way to give VMware additional validation to the industry.

How long did it take you to complete the whole VCDX journey?

From start to finish, an extremely long year, it was extremely rewarding being part of the beta program process though. Being able to assist in the shaping of the program, how the Design and Administration exams were written was really fun.

What advice would you give to people thinking of pursuing the VCDX accreditation?

Know your stuff, inside and out. The VCDX is really a mixture of everything, knowing not only how to fully design and architect a virtualized infrastructure (not only from a VMware perspective but the associated compute, network and storage), but also how to implement that design, manage that design, upgrade that design and operate that design.

Get as much hands on as possible. Learn as much as possible about how compute, network and storage relate to virtualization. Learn how applications relate to virtual machines. Know your design. That’s probably the most important thing, for your defense do not design your dream architecture, keep it simple and keep it to something you’ve done before. Know it inside and out, know what failed and how you fixed it. Don’t say you designed a specific thing to meet Best Practices, know why it’s the best practice.

Also, if you’re married… get your spouse’s buy-in on the journey as well. You’re going to spend a lot of time away from them while you’re on the journey and you’ll need more support than you’ve ever needed before.

If you could do the whole VCDX journey again what would you do differently?

No regrets, I loved every single part of the process. I wish I didn’t rush through it as fast as I possibly did, but I was so excited to be part of something new, something fresh and something fun! How awesome is it to be VCDX #21! J

Life after the VCDX?  How did your company respond?  Was it worth it?

I think obtaining the VCDX certification opened a lot of new doors and opportunities. Since obtaining my VCDX I’ve joined EMC as part of their vSpecialist organization, here I’m able to put my knowledge and experience directly to use as I evangelize companies about all of the amazing benefits of virtualization, application modernization, end-user computing and now cloud computing.


Leave a comment

VMware vSphere Data Protection

vSphere Data Protection (VDP) is a robust, simple-to-deploy, disk-based backup and recovery solution. VDP is fully integrated with VMware vCenter Server and enables centralized and efficient management of backup jobs while storing backups in de-duplicated destination storage.

Benefits:

•VDP leverages VMware vSphere API for Data Protection (VADP) which includes Changed Block Tracking (CBT) along with the EMC Avamar variable-length segment de-duplication engine to optimize backup and recovery times. Initial backups take a fair amount of time, but subsequent backups can be as little as a few minutes depending on the number of changes that have occurred since the last backup.

•Backup agents are not needed as VDP leverages VADP. VMs are backed up to disk-based storage (.vmdk files attached to the VDP virtual appliance).

•De-duplication occurs not only within each VM, but across all backups jobs and all VMs being backed up by the VDP appliance.

•A VM that utilizes an agent for backup and recovery require the VM to be in a powered on state. With VDP, that is not the case – backups and recoveries can be performed regardless of the VM’s power state.

•The is no need to install backup management software on an administrator’s workstation. Configuration and management of VDP is web browser based. Currently supported browsers: IE 7, 8 on Windows. Firefox 3.6 and higher on Windows or Linux. Adobe Flash is required.

•Restores can be entire VM or individual files and folders/directories. The file-level restore user interface (UI) is web based, simple, and intuitive meaning end-users can perform self-service file-level restores (administrator permissions required).

•Deployment, configuration and management of VDP is done via a web browser based graphical user interface (GUI). The majority of configuration tasks are completed using intuitive wizard-driven workflows.

vSphere Data Protection Key Components

VDP VM Appliance

•VDP is a virtual machine appliance deployed from a .ova (open virtual appliance or application) file.

vSphere Infrastructure

vSphere API for Data Protection (VADP) is utilized by VDP. This includes the Changed Block Tracking (CBT) mechanism. CBT tracks the changes made to a VM at the block level and provides this information to VDP so that only changed blocks are backed up. This significantly reduces storage consumption and speeds up backup and recovery times with VDP.

•VMware Tools on Windows contains Volume Shadow Copy Service (VSS) components to assist with guest OS and application quiescing when backing up Windows VMs. More details on VSS can be found here: http://technet.microsoft.com/en-us/library/ee923636(v=WS.10).aspx

VDP Architecture

•The appliance is deployed by default with 4 vCPUs and 4 GB RAM.

•Available in three sizes: 5 TB, 1 TB, and 2 TB – these are usable destination datastore sizes. The actual amount of disk space (thick provisioned) consumed by the appliance is 850 GB (3 .vmdk files), 1600 GB (7 .vmdk files), and 3100 GB (13 .vmdk files) respectively. Thin provisioning can be used, but the administrator should closely monitor disk consumption. It is important to note that once the VDP appliance is deployed, the size cannot be changed.

•The VDP appliance guest OS is SuSE Linux 11.

•vCenter Server 5.1 is required to use VDP. VDP can backup VMs on hosts running vSphere 4.0 and higher.

•VDP management is done via the vSphere Web Client. There is no plug-in for the vCenter Server “thick” client.

 


4 Comments

vSphere 5.1 Announced with Enhanced vSphere Replication

vSphere Replication

vSphere Replication (VR) is the industry’s first and only genuinely hypervisor-level replication engine.

It is a feature first introduced with Site Recovery Manager 5.0 to allow for the vSphere platform to protect virtual machines natively by copying their disk files to another location where they are ready to be recovered.

VR is a software based replication engine that works at the host level rather than the array level.

Identical hardware is not required between sites, and in fact customers can run their VMs on any type of storage they choose at their site – even local storage on the vSphere hosts, and VR will still work.

It provides simple and cost-efficient replication of applications to a failover site

VR is a component delivered with vSphere editions of Essentials Plus and above, and also comes bundled with Site Recovery Manager. This offers protection and simple recoverability to the vast majority of VMware customers without extra cost.

•With VR, a virtual machine is replicated by components of the hypervisor, removing any dependency on the underlying storage, and without the need for storage-level replication.

•VMs can be replicated between *any* type of storage platform: Replicate between VMFS and NFS, from iSCSI to local disk. Because VR works above the storage layer it can replicated independently of the file systems. (It will not, however, work with physical RDMs.)

•Replication is controlled as a property of the VM itself and its VMDKs, eliminating the need to configure storage any differently or to impose constraints on storage layout or management. If the VM is changed or migrated then the policy for replication will follow the VM.

•VR creates a “shadow VM” at the recovery side, then populates the VM’s data through replication of changed data.

•While VR can be deployed through the “thick client” all management and interaction with VR is done strictly through the vCenter 5.1 web interface.

•Only vSphere 5.0 and 5.1 will work for vSphere Replication as the VR Agent is a component of the vSphere 5.x hypervisor.

•vSphere Replication can not co-exist with the vSphere Replication pieces originally shipped with SRM 5.0. If an existing SRM 5.0 vSphere Replication environment is in place it will need to be uninstalled and replaced with the standalone vSphere Replication from vSphere 5.1.

•While both Storage DRS and sVmotion are supported, they will cause certain scenarios to be aware of

•While Storage vMotion of a VR protected VM can be done by an administrator, on vSphere 5.0 this may create a “full sync” scenario in which a VM must be completely resynchronized between source and destination, possibly violating the configured recovery point objective for that VM.

•Storage DRS compounds this problem by automating storage vMotion, and thereby may potentially cause the protected virtual machines to create continual full sync scenarios, driving up I/O on the storage, thereby creating cyclical storage DRS events. Because of this it is unsupported with 5.0.

•Storage vMotion and SDRS are only able to be run on the *protected* VM and can not execute against the *replica* of the VM.

•When using vSphere Replication with Site Recovery Manager, storage vMotion and storage DRS are *not supported*

•Neither of these scenarios is true with vSphere 5.1 as the persistent state file that contains current replication data is migrated along with the rest of the VM, which did not occur in vSphere 5.0.

vSphere Replication is not “new” as it has more than a year-long track record of success with Site Recovery Manager.

VR is a non-disruptive technology: It does not use vSphere file-system snapshots nor impact the execution of the VM in any abnormal way.

Since VR tracks changes at a sub-VM level, but above the file system, it is completely transparent to the VM unless Microsoft Volume Snapshot Service is being used to make the VM quiescent. Even then VR uses fully standard VSS calls to the Microsoft operating system.

Virtual machines can be replicated irrespective of underlying storage type • Can use local disk, SAN, NFS, and VSA
• Enables replication between heterogeneous datastores
• Replication is managed as a property of a virtual machine

• Efficient replication minimizes impact on VM workloads

vSphere Replication Use Cases

Protecting VMs within a site, between sites, or to and from remote and branch offices.

Can use dissimilar storage, low cost NAS Appliances, even independent vSphere hosts with only local disk.

VR Deployment

VR is deployed via a standard virtual appliance OVF format.

The OVF contains all the necessary components for VR.

•What used to be both the “VRMS and VRS” in the SRM 5.0 implementation of VR are included in the “VR Appliance” now

•This allows a single appliance to act in both a VR management capacity and as the recipient of changed blocks

•Scaling sites is an easy task, simply deploy another VR Appliance at the target site and it will contain the necessary pieces to either pair and mange replication for a site or simply receive changed blocks as per the VRS

vSphere Replication Limitations

vSphere Replication is targeted at replicating the virtual disks of powered on virtual machines only. It is based on a disk filter to track changes that pass through it, therefore static images can not be tracked.

Powered-off or suspended VMs will not be replicated. The assumption is that if the VM is important enough for protection, it is powered on.

That also means non-disks attached to a VM (ISOs, floppy images, etc) are not replicated. Also any disks, ISOs, or configuration files not associated with a VM will not be replicated.

Files that moreover are not required for the VM to restart (e.g., vswp files or log files) are not replicated by VR.

Since VR works above the disk itself at the virtual device layer, it can be completely independent of specifics about the VMDK it is replicating. VR can replicate to a different format than its primary disk – i.e. you can replicate a thick provisioned disk to be a thin provisioned replica.

VM snapshots in and of themselves are not replicated but instead are collapsed during replication. A VM with snapshots may be configured for protection by VR (and you can take and revert snapshots), but the remote state for such VMs will be “flat” without any snapshots. Snapshots are aggregated into a single VMDK at the recovery location.

Note: Reverting from a snapshot may cause a full sync!

VMs can be replicated with a recovery point objective (RPO) of at most 15 minutes and at least 24 hours. This means that a recovery of replicated VMs will lose at least 15 minutes worth of recent data.

How it works

Fundamentally VR is a handful of virtual appliances that allow the vSphere kernel to identify and replicate changed blocks between sites. The configuration and deployment is a handful of simple steps.

Once the administrator has deployed the components it is a matter of pairing a source and destination.

Lastly, configuration of an individual VM for protection tells VR to start replicating its changes, and where to put them at the recovery location.

Only replicates changed blocks

On an ongoing basis, after the first sync, VR will only ship changed blocks.

Within the RPO defined by the administrator, VR tracks which blocks are being dirtied and will create a “lightweight delta” (LWD) bundle of data to be transferred to the remote site.

Pointers to changed blocks are kept in both a memory bitmap as well as a “persistent state file” (psf) located in the directory of a VM. Memory contents are always current, the PSF file represents the current shipping LWD. After an LWD is shipped and completely acknowledgd, the memory bitmap is copied to the PSF file and the memory bitmap is restarted for the next LWD.

VR will use the defined RPO to determine how often to create a LWD. Time must be allowed to create the block bundle, transfer it, and successfully complete writing the entire bundle to ensure that the RPO is not violated. In order to do this, VR will track the length of the previous 15 transfers to create an estimate of how long it will take to complete the transaction of the subsequent LWD.

For example, if a transfer takes 1 minute to create, 8 minutes to transfer, and 1 minute to write, by the time the data is successfully written the original VM is now 10 minutes old. With, for example, a 1 hour RPO set for a VM, the next transfer would need to take place at least within the next 40 minutes. This presumes 10 minute old data plus the next 10 minute transfer = 20 minutes gone out of the 1 hour RPO to ensure the data at the recovery site is never older than the RPO defined.

If a transfer of a LWD takes more than half the time of the RPO it is very likely that the RPO will be violated based on the incremental “catch up” to the RPO period and it will be flagged as a potential RPO violation.

VR will create a per-host replication schedule by taking into account *all* the VMs being replicated from that particular host. This allows it to do host-wide scheduling for each replicated VMDK and allows transfers to take place according to variables such as length of transfer, size of LWD, etc. and gives the scheduler flexibility to send data when appropriate.

The scheduler will execute each time an event occurs that alters replication patterns, such as a power task on a replicated VM, changes to RPOs or a full sync, or an HA event such as a host crash.

Only the most-recent transfer information is persisted. If hostd crashes, or the VM is migrated, or reconfigured, the historic transfer state is lost, and must be re-accumulated for the scheduler to be most effective.

It is important to note that VR is *not* using vSphere based snapshots to create redo logs of the primary VMDK. The VMDK is not interrupted in any fashion at all, and there is no snapshot created.

It also does NOT use “CBT” or “Changed Block Tracking”, another feature of the vSphere Platform. The vSCSI filter of VR is completely independent of CBT by design. This allows CBT to remain untouched for other tools such as VADP and backup software. If CBT were to be used it would reset the changed block tracking epoch, breaking backups and other uses of CBT.

VR is 100% isolated from snapshots and CBT.

Recovering a VM with a few clicks

A VM can be recovered only if it is not powered on somewhere else or is not reachable by the recovery vCenter Server. This is to avoid having duplicate VMs running at the same time.

For further safety, the VM is booted with no networks connected to help avoid duplicate VMs colliding.

Once the recovery is processed, you can not reconnect and re-enable replication of that VM. You must re-start protection all over again. You may, however, use the old VMDK that might remain at either site as a seed to begin replication again.

Four steps for full recovery

As long as the replication has completed at least once a VM can be recovered quickly and easily directly from the vCenter Web Client.

From the Replication location in the Web Client, choose a VM that has been replicated, right-click and choose to recover.

Choosing a target folder and resource (Cluster, host, or resource pool) will then instantiate the replicated vm, create and register the vmx, attach the VMDK and power-on the VM if chosen.

This can not be automated, and can only be done a single VM at a time.


Leave a comment

vSphere 5.1 Announced with Site Recovery Manager 5.1

With the announcement of vSphere 5.1 is also the announcement of Site Recovery Manager 5.1. Below are some of the new features and enhancements coming with SRM 5.1

Application Quiescence for vSphere Replication

The new VR has improved VSS integration and doesn’t merely request OS quiescence, but flushes app/db writers if present.

This is due to better handling of VSS through the VMware Tools present in vSphere 5.1 and requires no work to configure – merely select the quiescing method and VR will handle it.

If VR is asked to use VSS, it will synchronize its creation of the lightweight delta with the request to flush writers and quiesce the application and operating system. This ensures full app consistency for backups.

vSphere Replication is presented the quiescent and consistent volume produced by the OSS flushing the VSS writers, and that consistent volume is used to create the LWD for replication.

If for some reason the VSS can not quiesce correctly or flush the writers, VR will continue irrespective of the failure and create an OS consistent LWD bundle at the VM level, and generate a warning that VSS consistency was not able to be created.

All Paths Down Improvements

The way vSphere 5 handles hosts with devices in an “All Paths Down” state has been improved to ensure that the host does not get stuck in a loop attempting I/O on unavailable devices.

APD states often occur during disaster scenarios, and as such it becomes important for SRM that the platform not cause delay for recovery.

SRM now checks for a datastore’s accessibility flag before deciding whether or not to attempt to use that datastore. A datastore may become inaccessible because of various reasons, one of which is APD.

The changes in how vSphere handles these devices enables SRM to differentiate APD from other types of inaccessible states such as and Permanent Device Loss (PDL).

If SRM sees a datastore in an APD condition, it will stop immediately and try again later, since APD conditions are supposed to be transient, rather than time out trying to access a missing device.

SRM also has been improved to use a new unmount command to gracefully remove datastores from the primary protected site during the execution of a recovery plan. Since SRM needs to break replication and unmount the datastore from the protected environment the new method allows for a graceful dismount and generation of an APD situation rather than an abrupt removal of the datastore.

During a disaster recovery, however, in some cases hosts are inaccessible via network to gracefully unmount datastores, and in the past the isolated hosts could panic if their storage was removed abruptly by SRM.

With vSphere 5.1 there are new improvements to the hosts and storage stacks that allow them to remain operative even through an unplanned APD state.

Forced Failover

Forced failover was introduced in SRM 5.0.1 for recovery plans using array based replication protection groups. With SRM 5.1 forced failover is now fully supported for all protection group types.

In some cases SRM will be unable to handle storage failure scenarios at the protection site. Perhaps the devices have entered an APD or PDL state, or perhaps storage controllers are unavailable, or for many other reasons. Perhaps the original SAN is reduced to a puddle of molten slag.

In these cases, SRM can enter a state where it waits for responses from the storage for an untenable amount of time. For instance, timeouts have been seen to last as long as 8 hours while waiting for responses from ‘misbehaving’ storage at the protected site.

Forced failover handles these scenarios. If storage is in a known inconsistent state, a user may choose to run a recovery plan failover in “forced failover” mode. Alternately, if a recovery plan is failing and timing out due to unresponsive protected site storage, the administrator could cancel the running recovery plan and launch it again in forced failover mode.

Forced failover will run *only* recovery-side operations of the recovery plan. It will not attempt any protected site operations such as storage unmounts or VM shutdowns. During a forced failover execution of a recovery plan any responses generated by the protected site are completely ignored.

Array-based replication forced failover worked with SRM 5.0.1, and after extensive testing has now been introduced to work with vSphere Replication as well.

Failback supported with both Array and vSphere Replication

SRM 5.1 now includes vSphere Replication in the “automated failback” workflow!

With SRM 5 VMware introduced the “Reprotect” and failback workflows that allowed storage replication to be automatically reversed, protection of VMs to be automatically configured from the “failed over” site back to the “primary site” and thereby allowing a failover to be run that moved the environment back to the original site.

Taken together as “automated failback” this feature was well received by those using array-based replication, but was unavailable for use with vSphere Replication.

With SRM 5.1 users can now do automated reprotects and run failback workflows for recovery plans with any type of protection group, both VR and ABR inclusive.

After running a *planned failover only*, the SRM user can now reprotect back to the primary environment:

Planned failover shuts down production VMs at the protected site cleanly, and disables their use via GUI. This ensures the VM is a static object and not powered on or running, which is why we have the requirement for planned migration to fully automate the process.

The “Reprotect” button when used with VR will now issue a request to the VR Appliance (VRMS in SRM 5.0 terminology) to configure replication in opposite direction.

When this takes place, VR will reuse the same settings that were configured for initial replication from the primary site (RPO, which directory, quiescence values, etc.) and will use the old production VMDK as seed target automatically.

VR now begins to replicate replicate back to the primary disk file originally used as the production VM before failover.

If things have gone wrong at the primary site and an automatic reprotect is not possible due to missing or bad data at the original site, VR can be manually configured, and when the “Reprotect” is issued SRM will automatically use the manually configured VR settings to update the protection group.

Once the reprotect is complete a failback is simply the process of running the recovery plan that was used to failover initially.

vSphere Essentials Plus Support

SRM 5.1 is now supported with vSphere Essentials Plus, enabling smaller companies to move towards reliable disaster recovery protection for their sites.

•vCenter version 5.1 is the only version that will work with SRM 5.1. Lower versions of vSphere/VI are supported, but vCenter must be up to date.

•At time of shipping, only vSphere 4.x and 5.x are supported.

•ONLY ESXi 5.0 and 5.1 will work for vSphere Replication as the VR Agent is a component of the ESXi 5.x hypervisor.

•While both Storage DRS and sVmotion are not supported with SRM 5.1, they will work in some scenarios even though unsupported.

•While Storage vMotion with array-replicated protected VMs can be done by an administrator, they must then ensure that the target datastore is replicated and that the virtual machine is once again configured for protection. Because this is a very manual process it is not officially supported.

•Storage DRS compounds this problem by automating storage vmotion, and thereby will cause the VMDK of the protected virtual machines to migrate to potentially un-protected storage. Because of this it is unsupported with SRM 5

•Storage vMotion and Storage DRS are not supported at all with SRM 5 using vSphere Replication as migration of a VMDK will cause the migrated VM to reconfigure itself for protection, potentially putting it in violation of its recovery point objective.